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• ■ ABSTRACT 

We present the Data Release 9 Quasar (DR9Q) catalog from the Baryon Oscillation Spectroscopic Survey (BOSS) of the Sloan Digital Sky Survey 
q ■ III. The catalog includes all BOSS objects that were targeted as quasar candidates during the survey, are spectrocopically confirmed as quasars via 

■ visual inspection, have luminosities Mj[z=2] < -20.5 (in a ACDM cosmology with H = 70 km s -1 Mpc~', Ci M = 0.3, and fl A = 0.7) and either 
1 display at least one emission line with full width at half maximum (FWHM) larger than 500 km s" 1 or, if not, have interesting/complex absorption 

^ features. It includes as well, known quasars (mostly from SDSS-I and II) that were reobserved by BOSS. This catalog contains 87,822 quasars 
i i. (78,086 are new discoveries) detected over 3,275 deg 2 with robust identification and redshift measured by a combination of principal component 
eigenspectra newly derived from a training set of 8,632 spectra from SDSS-DR7. The number of quasars with z > 2.15 (61,931) is ~2.8 times 
' larger than the number of z > 2.15 quasars previously known. Redshifts and FWHMs are provided for the strongest emission lines (C iv, C m], 
^ Mg n). The catalog identifies 7,533 broad absorption line quasars and gives their characteristics. For each object the catalog presents five-band 
(u, g, r, i, z) CCD-based photometry with typical accuracy of 0.03 mag, and information on the morphology and selection method. The catalog 
\Q ■ also contains X-ray, ultraviolet, near-infrared, and radio emission properties of the quasars, when available, from other large-area surveys. The 
' calibrated digital spectra cover the wavelength region 3,600-10,500 A at a spectral resolution in the range 1,300 < R < 2,500; the spectra can be 
retrieved from the SDSS Catalog Archive Server. We also provide a supplemental list of an additional 949 quasars that have been identified, among 

■ galaxy targets of the BOSS or among quasar targets after DR9 was frozen. 

- Key words. Keywords: catalogs, surveys, quasars: general 
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1. Introduction iDawson etai1l2012l) . BOSS is the main dark time legacy survey 

, . ,. I-?-, — m rrTTT^ • of the third stage of the Sloan Digital Sky Survey (SDSS-III, 

Since their discovery d Schmidt| ll263|), interest in quasars Eisensteinet a i/ 20 ll). It is based on the ninth data release of 

g ■ has grown steadily, both because of their peculiar properties the SDSS dAhn etllBmh . BOSS is a five-year program to ob- 

and because of their importance for cosmology and galaxy ^ a of , 5 of laxies and over 150 , 00z > 2.15 

evolution. Many catalogs have gathered together increasing s The main j of ±e & h tQ detect ^ characteris . 

numbers of quasar s en^er from heterogeneous sarnies (see & scale . inted b b acoustic osci i lations (B AO) in the 

i Hewitt&Burbxdg^ [Hp | Veron-Cetty & Vero^ £o5l and ear , uniyer P se from ^ ^ distribution of both mminous red 

references therein) or from large surveys, most impor tantly: galaxies at z „ QJ and H , absorption lines in the inter galactic 



— ' 1 ; 1 — 1 1 * I yaiaAics ill z ~ u.i anu n i ausuiuuun lines in liic iniciyaiacuc 

the Large B mM Quasar Survey (LBQS iMoms et al. | U^Ell medium (IGM) at z „ 2 .5. BOSS uses the same imaging data as 

Hew ettet alj | l995|); the 2dF Quas ar Redshift Survey 2QZ; fa SDSS _ T and n ^ an extension m the South Galactic c 

Boyle et al. 2000; Croom et al. 2001) and the successive releases (crr\ 



J3UV1C CI tll.lZ,ULIUI.H^100m CI <U.IZ,UU1I) CU1U L11C SULLCbblVC IClCUbCb (QPC^ 

of the Sloan D igital Sky Survey (SDS S. lYork et al. 2000) Quasar 

Catalogs (e.g JSchneider et alj|2010i forDR7). The ba O clustering measurements in the IGM require a 

This paper describes the first quasar c atalog of the Baryon quasar catalog of maximal purity and accurate redshifts. Indeed 

Oscillation Spectroscopic Survey (BOSS, ISchlegel et al] 120071; the spectra of any non-quasar object, especially at high signal-to- 

noise ratio, will dilute the signal and/or increase the noise in the 

* Hubble fellow clustering measurement. The automated processing of the spec- 
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tra dBolton et al-ll2QT2h is sophisticated, but is not perfect. The 
identification of the objects and their redshifts have therefore to 
be certified before any analysis is performed. The present cata- 
log, henceforth denoted DR9Q catalog, contains 87,822 quasars 
identified among the objects targeted as quasar candidates over 
an area of 3,275 deg 2 surveyed during the first two years of 
BOSS operations. We also give a supplemental list of quasars 
identified among galaxy targets. This catalog keeps the tradi- 
tion of producing quasar catalogs (Schnei der et alj|20 02. 2003, 
120051 120071 12010) from SDSS-I and II dYork et alJ200Ch. The fi- 
nal version of the SDSS-II quasar catalog ( Schneider et al. 2010) 
based on the seventh SDSS data release ( Abazaiian et al. 2009) 
contains 105,783 objects mostly at z < 2 (see IShen et aLll201 ll 
for their properties). Note that the DR9Q catalog does not con- 
tain all DR7 quasars but only those DR7 quasars that were reob- 
served during the two first years of BOSSj- High redshift (z > 2) 
quasar continua together with pixel masks, improved noise esti- 
mates, and other products designed to aid in the BAO-Lyman-q 
clustering analysis will be released in lLee et all (12012. in prep.l) . 

The selection of candidates and observations are summa- 
rized in Section [2] We describe the visual inspection of all tar- 
gets in Section [3] present accurate redshifts for the quasars in 
Section [4] and describe the detection and measurement of broad 
absorption lines (BALs) in Section[5] The catalog is described in 
Section[6] We give a catalog summary in SectionQand comment 
on the supplemental lists of quasars in Section [8] We conclude 
in Section[9] 

In the following we will use a ACDM co smology with Hp = 
70 km s" 1 Mpc -1 , Q M = 0.3, and O a = 0.7 dSpergelet al.ll2003l) . 

Most of the objects in the catalog show at least an emis- 
sion line with FWHM > 500 km s _1 in their spectra. However, 
there are a few exceptions: a few objects have emission lines 
with smaller FWHM due to noise or dust obscuration (Type II 
quasars) others have very weak emission lines but are identi- 
fied as quasars because of th e presence of the Lyman-a forest 
(Dia mond-Stanic et a ll 120091) . We will call a quasar an object 
with a luminosity M;[z=2] < -20.5 and either displaying at least 
one emission line with FWHM greater than 500 km s _1 or, if 
not, having interesting/complex absorption features. This defini- 
tion is slightly different from the one used in SDSS-DR7. The 
change in absolute magnitude is to include a few low-z objects 
in the catalog. Because BOSS is targeting z > 2.15 quasars, the 
median absolute luminosity is higher in BOSS than in SDSS- 
DR7. All BOSS objects with z > 2 qualify for the SDSS-DR7 
definition: FWHM > 1000 km s 1 and M;[z=0] < -22 (adopt- 
ing the same cosmology and a Y = -0.5). In the following, all 
magnitudes will be PSF magnitudes. 



2. Survey outline 

In order to measure the BAO scale in the Lyman-a forest at 
z ~ 2.5, BOSS aims to obtain spectra of over 150,000 quasars 
in the redshift range 2.15 < z < 3.5, where at least part of the 
Lyman-a forest lies in the BOSS spectral range. The measure- 
ment of clustering in the IGM is independent of the properties of 
background quasars. Therefore the quasar sample does not need 
to be uniform and a variety of selection methods are used to in- 
creas e the surface density of high redshift quasars (Ross et al] 
120121) . Some quasars with z < 2 will be targeted in the course 



1 All known z > 2.15 quasars in BOSS footprint are being reobserved 
to obtain spectra of uniformly high SNR in the Lyman-a forest and to 
enable variability studies 



of specific ancillary science programs or as a consequence of 
imperfect high-redshift quasar selection. 

To detect the BAO signal, a surface density of 15 
quasars with z > 2.15 p er square degree is required 
(iMcDonald & Eisensteinll2007l) . For comparison, SDSS-I/II tar- 
geted about ~ 14, 000 z > 2.15 quasar s over the full survey, 
e.g. -8,400 deg 2 dSchneider et al.l l2010h . leading to a surface 
density of -2 quasars per square degree in the redshift range 
of interest for BOSS. To reach the BAO quasar density require- 
ment implies targeting to fainter magnitudes than SDSS-I/II. The 
BOSS l imiting magnitud e for target selection is r < 21.85 or 
g < 22 dRoss et al.ll2012l) . while z > 3 quasars were select ed to 
be brighter than ~ 20.2 in SDSS-I/II dRichards et alJl2002h . 

2.1. Imaging data 

BOSS uses the same imaging data as that of the original SDSS- 
I/II survey, with an extension in the SGC. These data were gath- 
ered using a dedicated 2.5 m wide-field telescope dGunn et al.l 
I2006I) to collect light for a camera with 30 2kx2k CCDs 
dGunn et al.ll 1998b over five broad bands - ugriz dFukugita et al.l 
1996); this camera has imaged 14,555 unique square degrees 
of the sky, includi ng -7,500 deg 2 in the NGC and -3,100 
deg 2 in the SGC (Aih ara et all 1201 ll) . The imagin g data were 
taken on dark photometric nights of good seeing (Hogg et al. 
2001). Ob jects were detected and their prop erties were measured 
dLupton et aL|l2001b IStoughton et al.ll2002l) and calibrated pho- 
tometrically Jsmifh et al.l 2002t llvezic et al.ll2004t iTucker et al 1 
2006; Pa dmanabhan et afll2008l) . and astrometrically dPier et al 
120031 



2.2. Target selection 

The target selection of quasar candidates is crucial for the goals 
of the quasar BOSS survey. On average 40 fibers per square de- 
gree are allocated by the survey to the quasar project. The sur- 
face density of z > 2.15 quasars t o the BOSS magnitude limit 
is app roximately 28 per deg 2 (see iPalanque-De labrouille et all 
2012). Thus, recovering these quasars from 40 targets per square 
degree in single-epoch SDSS imaging is challenging because 
photometric errors are significant at this depth and becaus e the 
quasar locus (in ugriz) crosses the stellar lo cus at z - 2.7 (iFanl 
1999: lRichards et al.ll2002l: iRoss et al.ll2012l) . All objects classi- 
fied as point-sources in the imaging data and brighter than ei- 
ther r = 21.85 or g = 22 (or both, magnitudes dereddened for 
Galactic extinction) are passed through the various quasar target 
selection algorithms. The quasar target select ion for the first tw o 
years of BOSS operation is fully described in lRoss et al. l d20L2l) . 
We briefly summarize here the key steps. 

The target selection algorithm is designed to maximize the 
number of quasars useful for the Lyman-a forest analyses and 
reach the requirement of 15 deg~ 2 quasars with z > 2. 15. Several 
target selection methods are therefore combined and data in 
other wavelength bands are used when available. At the same 
time, in order to use the quasars themselves for statistical stud- 
ies, s uch as the quasar l uminosity function or clustering analyses 
(e.g. IWhite et alj|2012l) . part of the sample must be uniformly 
selected. Thus, the BOSS quasar target selection is split in two 
parts: 

- About half of the targets are selected as part of the so-called 
"CORE" sample using a single uniform target selection algo- 
rithm. The likelihood method (Kirkpa trick et alj|201 lb was 
adopted for the CORE selection during the first year of ob- 
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servations. Starting with the second year of operation, it was 
replaced by the extreme deconvolution method (XDQSO; 
Bovy etal. 201 ]]) which better takes photometric errors into 
account. 

Most of the remaining quasar candidates are selected as part 
of the so-called "BONUS" sample through a combination of 
four methods: the Non-Parametric Ba yesian Classification 
and K ernel Density Estimator (KDE; Richards et al.l 12004 
2009), the like lihood method jKrrkpatricketalJl2011h . a 
neural network dYeche et alJl2010h and the XDQSO method 
dBovv et ail 1201 ll 20121 objects for lower likelihood than 
in the CORE sample, over a slightly ex panded redshift 
range, and incorporating data from UK I DSS; Lawrence et al.l 
(120071) : from GALEX; iMartin et al.l (|2005|); and, where 
available, from coadded imaging in overlapping SDSS runs). 
The outputs of all of these BONUS methods are combined 
using a neural network 



Point-sources that match FIRST dBecker et al.l 119951) and 
that are not blue in u - g (which would be characteristic of 
Z < 2 quasars) are also always included. In addition, previ- 
ously known z > 2.15 quasars (mostly from SDSS I/II) were 
also re-targeted for several reasons: (i) the BOSS wavelength 
range is more extended than in previous surveys; (ii) BOSS 
spectra have u sually high er signal-to-noise ratio (SNR) than 
SDSS spectra (lAhn et all 120121) : (iii) the two epoch data will 
allow spectral variability studies. This sample is selecte d us- 
ing the SDSS-DR7 quasar catalo g (ISchneider et ail|2010l) . the 
2dF QSO Redshift Survey (2QZ: ICroom et al.ll2004l). "the 2dF - 
SDSS LRG and QSO Survey (2SLAO: ICroom et al.l 120091 
the AAT-UKIDSS-SDSS (AU S) survey, and th e MMT-BOSS 
pilot survey (Appendix C in iRoss et all 120121) . Quasars ob- 
served at high spectral resolution by UVES-VLT and HIRES- 
Keck were also included in the sample. Finally, BOSS in- 
cludes targeting of a number of ancillary programs, some de- 
signed specifically to target quasars (e.g., the variability pro - 
grams; lPalanque-Delabrouille et al.l201 ltlMacLeod et al.l2012l) . 
The corresponding programs include: 

• Reddened Quasars: Quasar candidates with high intrinsic red- 
dening. 

• No Quasar Left Behind: Bright variable quasars on Stripe 82. 

• Variability-Selected Quasars: Variable quasars on Stripe82, fo- 
cused on z > 2.15. 

• K-band Limited Sample of Quasars: Quasars selected from 
SDSS and UKIDSS K photometry. 

• High-Energy Blazars and Optical Counterpars of Gamma-Ray 
Sources: Fermi sources, plus blazar candidates from radio and 
X-ray. 

• Remarkable X-ray Source Populations: XMM-Newton and 
Chandra sources with optical counterparts. 

• BAL Quasar Variability Survey: Known BALs from SDSS- 
I/II. 

• Variable Quasar Absorption: Known Narrow-line absorption 
quasars from SDSS-I/II. 

• Double-Lobed Radio Quasars: Point sources lying between 
pairs of FIRST radio sources. 

• High-Redshift Quasars: Candidates at z > 3.5 in overlap be- 
tween scanlines. 

• High-Redshift Quasars from SDSS and UKIDSS: Candidates 
at z > 5.5 from SDSS and UKIDSS photometry. 

• Previously Known Quasars with 1.8 < z < 2.15: Reobserved 
to constrain metal absorption in the Lya forest. 

• Variable Quasars: selected from repeat observations in over- 
laps of SDSS imaging runs. 



These programs are described in det ail in the Appendix and 
Tables 6 and 7 of iDawson etail (120121) . 



2.3. Spectroscopy 

Because BOSS was designed to observe targets two magnitudes 
fainter than the original SDSS spectroscopic targets, substan- 
tial upgrades to the SDSS spectrograph s were required an d pre- 
pared during the first year of SDSS-III (ISmee et al.ll2012l) . New 
CCDs were installed in both red and blue arms, with much 
higher quantum efficiencies both at the reddest and bluest wave- 
lengths. These are larger format CCDs with smaller pixels, that 
match the upgrade of the fiber system from 640 fibers with 3 
arcsec optical diameter to 1,000 fibers (500 per spectrograph) 
with 2 arcsec diameter. The larger number of fibers alone im- 
proves survey efficiency by 50%, and because BOSS observes 
point sources (quasar targets) and distant galaxies in the sky- 
dominated regime the smaller fibers yield somewhat higher SNR 
spectra in typical APO seeing, though they place stiffer de- 
mands on guiding accuracy and differential refraction. The orig- 
inal diffraction gratings were replaced with higher throughput, 
volume-phase holographic (VPH) transmission gratings, and 
other optical elements were also replaced or recoated to im- 
prove throughput. The spectral resolution varies from ~ 1,300 at 
3,600 A to 2,500 at 10,00 A The instrument is described in 
detail in [ Smee et a"D d2012l) and the BOSS survey is explained in 
' 0121). 



IDawson et al.ld20K 

BOSS spectroscopic observations are taken in a series of at 
least three 15-minute exposures. Additional exposures are taken 
until the squared signal-to-noise ratio per pixel, (SNR) 2 , reaches 
the survey-quality threshold for each CCD. These thresholds 
are (SNR) 2 > 22 at /-band magnitude 21 for the red camera 
and (SNR) 2 > 10 at g-band magnitude 22 for the blue cam- 
era (extinction corrected magnitudes). Recall that the pixels are 
co-added, linear in log A with sampling from 0.82 to 2.39 A 
over the wavelength range from 3,610 to 10,140 A. The cur- 
rent spect roscopic reduction p ipeline for BOSS spectra is de- 
scribed in lBolton etaIld2012li . SDSS-III uses pl ates with 1000 
spect ra each, more than one plate can cover a tile dDawson et al.l 
2012). 819 plates were observed between December 2009 and 
July 2011. Some have been observed multiple times. In total, 
87,822 unique quasars have been spectroscopically confirmed 
based on our visual inspection. Fig.[T]shows the observed area in 
the sky. The total area covered by the SDSS-DR9 is 3,275 deg 2 . 
Fig. [2] displays the cumulative number of quasars as a function 
of the observation date. 




Fig. 1. The space distribution in equatorial coordinates of the 
SDSS-III DR9 data release quasars. 
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Jan 1st Jul 1st Jan 1st Jul 1st 

2010 2010 2011 2011 

Date 

Fig. 2. Cumulative number of quasars as a function of obser- 
vation date during the first two years of the survey. Horizontal 
times are due to the yearly summer shutdown during monsoon 
rains (summer 2010 at MJD = 55400) and the monthly bright 
time when BOSS does not observe. 



As z > 2 quasars are usually identified by the presence of 
strong Lyman-o' and C iv emission lines, we determine the SNR 
effectively achieved at the position of these lines. The median 
SNR per pixel at the position of various emission lines (Lyman- 
a, C iv, C in] complex and Mg n) and in the continuum are shown 
in Fig. |3] While the SNR per pixel in regions free of emission 
lines (black histogram) drops to be equal to ~ 1 at r ~ 22, the 
SNR at the top of the Lyman-a (green histogram) and C iv (red 
histogram) emission lines stays above about 4, allowing the iden- 
tification of a fair fraction of these objects at this magnitude. 

In order to classify the object, each spectrum is fit by the 
BOSS pipelinq^ with a library of star templates, a PCA decom- 
position of galaxy spectra and a PCA decomposition of quasar 
spectra. Each class of templates is fit over a range of redshifts: 
galaxies from z = -0.01 to 1 .00 quasars from z = 0.0033 to 7.00; 
and stars from z = -0.004 to 0.004 (+1200 km/s). The combina- 
tion of redshift and template with the overall best fit (in terms of 
the lowest reduced chi-squared) is adopted as the pipeline classi- 
fication (CLASS) and redshift measurement (Z ± Z_ERR). A warn- 
ing bitmask (ZWARNING) is set to indicate poor wavelength cov- 
erage, negative star template fits, broken/dropped fibers, fibers 
assigned to mesure sky background, and fits which are within 
A^ 2 /dof = 0.01 of the next best fit (comparing only fits with 
a velocity difference of 1,000 km s _1 ). A ZWARNING equals to 
zero indic ates a robust classification with no pi peline-identified 
problem (lAihara et al.ll201 UlBolton et al.ll2012l) . 

The classifications by the BOSS pipeline are not perfect 
however and visual inspection is required. Most misclassified 
spectra have low SNR. At SNR per pixel ~ 2, some objects are 
fit equally well by a star and a quasar template. Even if the ob- 
ject is correctly identified as a quasar, the redshift can be erro- 
neous, because one line is misidentified; the most common case 
is Mg H/12800 is misidentified as Lyman-Q'. But this can be also 
because of a strong absorption feature (e.g. a damped Lyman-Q' 



2 The software used is called idlspec2d and is publicly avail- 
able. The current version is v 5_4_45. Deta i ls ca n be found at 
http://www.sdss3.org/dr9/software iBolton et alj l l2012h 




18 20 22 



r-PSF magnitude 

Fig. 3. Median observed SNR per pixel at the top of the Lyman- 
a (green), C iv (red), C m] complex (blue), Mg n at z > 2 
(orange) and Mg n at z < 2 (yellow) emission lines and in 
emission-line free regions (black) versus r-PSF magnitude (cor- 
rected for Galactic extinction). Two redshift ranges are consid- 
ered for Mg ii because the emission line is redshifted in regions 
of the spectra with very different characteristics. At r ~ 22, the 
median SNR per pixel at the top of the Lyman-Q' and C iv emis- 
sion lines is about 4; sufficient to identify most of the quasars. 
Outside of the emission-line regions, at the same magnitude, the 
SNR per pixel is about unity. 

system, DLA, or a BAL) spoils the profile of an emission line 
and the pipeline is unable to recover it. 

2.4. Calibration warnings 

2.4.1. Excess flux in the blue 

The BOSS spectra often show excess light at the blue end (a 
similar problem was found in SDSS-DR7 spectra; Paris et al., 
2011). 

To quantify this problem we selected spectra where a 
damped Lyman-Q' system (DLA) is observed with aborp- 
tion redshift greater than 3.385 and with a column density 
N(H i) > lO 205 cirT 2 . There are 402 such quasars in the sam- 
ple. In these spectra, and because of the presence of the DLA, 
the flux is expected to be zero at /t b s < 4100 A (e.g. below the 
Lyman limit of all DLAs). When stacking the selected lines of 
sight (Fig. 0J, we note instead that the flux increases for wave- 
lengths below 4000 A. The excess light at A obs ~ 3600 A is 10% 
of the flux at A KSt = 1280 A where the spectra are normalized. 
This problem can affect the analysis of the Lyman-Q' forest (see 
e.g. iFont-Ribera et al.ll2012|) and is probably a consequence of 
imperfect sky subtraction dDawson et al.|[2012l) . It will be cor- 
rected in a future version of the pipeline. 

2.4.2. Spectrophotometric calibration 

To maximize the flux in the blue part of the quasar spectra, 
where the Lyman-a forest lies, it was decided to offset the po- 
sition of the quasar target fibers to compen sate for atmospheri c 
refraction and different focus in the blue (Daws on" et al.ll2.012h . 
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Wavelength (A) 

Fig. 4. Stack of DR9 BOSS spectra where a damped Lyman- 
a system is seen at an absorpstion redshift higher than 3.385 
with a column density A^(H i) > 10 205 crrT 2 . The spectra are 
normalized to unity near 1280 A in the quasar rest frame. Owing 
to the presence of the DLA, the flux is expected to be zero at 
observed wavelengths below ~4100 A (e.g. below the Lyman 
limit of all DLAs). This is not the case in the very blue part 
of the spectrum (A \, s < 4000A) where the mean observed flux 
appears to increase (spuriously). 



These offsets were not applied to the standard stars. The current 
pipeline flux calibration does not take these fiber offsets into ac- 
count, therefore the spectrophotometry of the main QSO targets 
(e.g. not the ancillary targets) is biased toward bluer colors over 
the full wavelength range. Spectrophotometry of these objects 
will preferentially exhibit excess flux relative to the SDSS imag- 
ing data at A < 4000 A and a flux decrement at longer wave- 
lengths. Because the fiber offsets are intended to account for at- 
mospheric differential refraction, data will show larger offsets 
in spectrophotometric fluxes relative to imaging photome try for 
observations performed at higher airmass. Daw son et alj d2012l) 
discuss in details the quality of the BOSS spectrophotometry and 
reports that stellar contaminants in the quasar sample (i.e. quasar 
candidates that are actually stars) have g - r colors 0.038 mag- 
nitudes bluer than the photometry with an RMS dispersion of 
0.158 magnitudes. 

This problem is illustrated in Fig. [5] where the median com- 
posite spectra of quasars observed by both SDSS-I/II and BOSS 
are plotted together. The resulting SDSS-DR7 spectrum is in red 
and BOSS spectrum in black. The BOSS composite spectrum is 
bluer than the same composite from SDSS-DR7 spectra. Note 
that this flux mis-calibration is different from object to object so 
that Fig. [5] shows only the mean difference between DR7 and 
DR9 spectra. 

2.4.3. Identified quasars with bad spectra 

During the course of the first two years of BOSS, different ver- 
sions of the SDSS spectroscopic pipeline were used after some 
systematic problems had been fixed, thus improving the overall 
quality of the data. The visual inspection described below is per- 
formed on the fly, within a few days after the data are obtained, 
qualified and reduced by the version of the pipeline that is avail- 
able at the time the data are obtained. Once an object is positively 
identified as a quasar, a galaxy, or a star from visual inspection, 
it keeps its identification in our catalog unless an apparent mis- 
take has been committed and is corrected in the course of some 
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Fig. 5. Composite spectra of 6,459 quasars observed both by 
SDSS-DR7 and BOSS for (i) SDSS-DR7 spectra (red) and, (ii) 
SDSS-DR9 spectra (black). The slope of the two composite 
spectra should be similar (as any variability should be averaged 
out). This is not the case because of the difference in focus of 
the BOSS quasars and standard stars. Note that this flux mis- 
calibration is different from object to object. 

check performed afterwards by the scanners or by a user of the 
data. 

When a new version of the pipeline is made available, all 
the data are re-reduced. We then reinspect objects with uncer- 
tain identifications (QS0_?, QS0_Z?, Star_?, see Section 3.2) or 
spectra that are not qualified (Bad) but we do not reinspect the 
objects with firm identifications. 

It can happen that the spectra of a few objects are of lesser 
quality with the new version of the pipeline. These objects are 
still in the catalog. 

Even the most thorough work of the kind described here can- 
not be absolutely flawless. We encourage the reader to signal any 
mistake to the first author of this paper in order to ensure highest 
quality of the information provided in the catalog. 

3. Construction of the DR9Q catalog 

In order to optimally measure the BAO clustering signal in the 
IGM, we must have as pure a catalog of quasars as possible. 
In this catalog, peculiar features such as broad absorption lines 
(BAL) or Damped Lyman-or systems (DLA) that may dilute the 
signal, should be identified. We therefore designed quality con- 
trol of the data based on a visual inspection of the spectra of all 
BOSS objects that might be a quasar. During commissioning and 
the first year of the survey this quality control was also very use- 
ful to report problems with the pipeline, which helped improve 
the overall quality of the data reductions. 

The catalog lists all the visually confirmed quasars. About 
10% of these quasars have been observed several times (Dawson 
et al. 2012), either because a particular plate has been re- 
observed (e.g. to increase the SNR for a particular scientific 
project), or because a particular region in the sky has been re- 
observed at different epochs (e.g. Stripe 82), or, because plates 
overlap. Now, and throughout BOSS, overlapping plates are used 
as an opportunity to increase the SNR on a few objets (e.g. 
CORE objects). These repeat observations are often useful to 
confirm the nature of objects with low SNR spectra. However 
we did not attempt to co-add these data mostly because they are 
often of quite different SNR. 
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3.1. A tool for the visual inspection 

Immediately after the processing by the BOSS pipeline, the re- 
duced data (spectra and pipeline classification) are copied to the 
IN2P3 (Institut National de Physique Nucleaire et de Physique 
des Particules) computing centefl A Java program gathers meta- 
information and saves it into an Oracle database. 

All spectra are matched to target objects, imaging and pho- 
tometry information, and SDSS-II spectroscopy. They are pro- 
cessed by a Java program that computes basic statistics from the 
spectra and fits a power law continuum and individual emission 
lines to each spectrum. The spectra are then made available on- 
line through a collaborative web application, from which human 
scanners can flag objects and decide classifications. 

This tool and the visual inspection procedure described in the 
next Section evolved with time during commissioning and the 
first six months of the survey. The whole procedure was repeated 
at the end of the first two years to guarantee the homogeneity of 
the catalog. 



3.2. Visual inspection procedure 

The identifications provided by the BOSS pipeline are already 
very good. Nevertheless about 12% of all quasar targets have 
a non-zero ZWARNING flag, i.e. their redshift is not considered 
to be reliable by the pipeline. After visual inspection, 4% of all 
confirmed quasars have a non-zero ZWARNING flag. Not surpris- 
ingly, the fraction of these objects increases with magnitude (see 
Fig.©. 




r-PSF magnitude 

Fig. 6. Fraction of visually confirmed quasars with a non-zero 
ZWARNING flag as a function of the r-PSF magnitude (after cor- 
recting for Galactic extinction). A positive ZWARNING means that 
the pipeline considers its redshift estimate to be unreliable. This 
fraction increases at faint magnitudes. 



We visually inspected all quasar candidates and objects from 
quasar ancillary programs (see Section 2.2) to (i) secure the iden- 
tification of the object and, (ii) reliably estimate the systemic 



CC-IN2P3, http://cc.in2p3.fr 



redshift of the quasar. We manually confirmed or modified the 
identification of the object and, when needed, corrected the red- 
shift provided by the BOSS pipeline, i.e. when it was wrong 
(when e.g. an emission line is misidentified or a bad feature was 
considered an emission line) or inaccurate (when emission lines 
are correctly identified but not properly centered). Examples of 
misidentified objects or inaccurate redshift estimates are dis- 
played in Fig. [7] 

All the information on the objects is stored in a database 
which is updated in real time as new data arrives from the tele- 
scope. Modifications from the visual inspection are stored also in 
the database. For each plate, the objects classified by the pipeline 
as star, QSO with z < 2, and QSO with z > 2 are made available 
to the scanner in three different lists. The cut in redshift corre- 
sponds to the Lyman-cc emission line entering the BOSS spec- 
trum. It also corresponds to a strong gap in the BOSS quasar 
redshift distribution due to target selection (see Fig.l22l). 

Most of the objects classified as stars by the pipeline are in- 
deed stars and most of the objects classified as quasar with z < 2 
are either quasars with z < 2 or stars (see below). The objects 
classified as quasars at z > 2 are ranked by decreasing SNR. 
This organizes the visual inspection and minimizes the risk of 
errors. Most of the quasars with z > 2, the most valuable for the 
survey, are inspected by two different individuals. 

Objects that cannot be firmly identified by visual inspection 
are labeled in several categories. Some spectra cannot be rec- 
ognized because either the SNR is too low, or the spectrum has 
been badly extracted; such objects are classified as Bad. For oth- 
ers, the classification is not considered to be robust, but there is 
some indication that they are stars (star_?) or quasars (QSO_?). 
For some objects both scanners were unable to give a firm identi- 
fication, such objects are labeled as '?'. Other objects are galax- 
ies (Galaxy). Finally some objects are recognized as quasars but 
their redshifts are not certain (QSCLZ?). 

The output of the visual classification is provided as fields 
class.person and z_con£_person in the specObjAll table of 
the SDSS Catalog Archive Server (CAS) or the specObjAll.fits 
file from the Science Archive Server (SAS). The correspondence 
between the visual inspection classification we describe in this 
paper (QSO, QSO_BAL, QSO_Z?, QSO_?, Star, Star.?, Galaxy, ?) 
and the values of z_conf_person and class_person is given 
in Table Q] Each time a new version of the BOSS pipeline be- 
comes available, the data are reprocessed and objects in the cat- 
egories bad, ?, QS0_? and QSCLZ? are inspected again. Examples 
of objects classified as QSCLZ? and QSCL? are displayed in Fig. [8] 
Only objects classified as QSO or QS0J3AL are listed in the offi- 
cial DR9Q catalog. Objects classified as QSO_Z? are included in 
the supplemental list of quasars (see Section [8]). Objects classi- 
fied as QS0_? are also given for information in a separate list. 

Of the 180,268 visually inspected targets corresponding to 
the DR9Q catalog, 87,822 were classified as unique quasars, 
81,307 as stars and 6,120 as galaxies. 1,362 objects are likely 
quasars (QS0_?), 112 are quasars with an uncertain redshift 
(QS0_Z?) and 578 are likely stars (Star.?). 2,599 targets have 
bad spectra (Bad) while we were not able to identify 368 objects 
(?). Therefore 97.5% of the objects are successfully classified. 
Only 27 true quasars were mis-identified by the BOSS pipeline 
as Star, while 1 1,523 stars were classified as QSO, most of them 
misidentified, however, as low redshift quasars, and only 1,241 
have ZWARNING — 0. Table [2] gives a summary of these num- 
bers. 

Note that Palanque -Delabrouille et all d2012l) have obtained 
deeper MMT QSO data of some of the BOSS targets classified 
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Fig. 7. First column: Examples of z > 2 quasars classified as STAR by the BOSS pipeline. The overall shape of the spectrum is 
similar to the spectrum of F stars. Second column: Examples of stars identified as QSO by the BOSS pipeline. Strong absorption 
lines or wiggles in the spectrum can mimic quasar features. Third colum: Examples of z > 2 quasars for which the BOSS pipeline 
provides an inaccurate redshift estimate that must be corrected during the visual inspection. The pipeline is confused by the strong 
absorption lines. The spectra were boxcar median smoothed over 5 pixels. 
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Fig. 8. Examples of QS0_? (top panels) and QS0_Z? (lower pan- 
els). The spectra were boxcar median smoothed over 5 pixels. 



as QS0_? and confirmed that essentially all of these objects are 
true quasars. 

During the visual inspection, a redshift is determined that 
will be refined further by an automatic procedure (see SectionH]). 
The redshift of identified quasars provided by the visual inspec- 
tion is obtained applying the following procedure: 



z_conf_person 
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class.person 
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Star.? 


Star 


3 




QSCL? 


QSCLZ? 


QSO 


4 








Galaxy 


30 








QSCLBAL 



Table 1. The table gives the classification from the visual in- 
spection corresponding to the combination of class_person 
(first column) and z_conf .person (first row) values provided in 
the headers of the SDSS-DR9 spectra available from the SDSS 
Catalog Archive Server. 



The first guess for the redshift is given by the BOSS pipeline 
and is not modified except if inaccurate or wrong. The red- 
shift from the pipeline can be wrong in cases where an emis- 
sion line is misidentified. The presence of strong absorption 
at or near the emission, and especially a strong DLA, is also 
a source of error. Often the redshift is just inaccurate be- 
cause either it misses the peak of the Mg n emission line (and 
we consider that this line is the most robust indicator of the 
redshift) or it is defined by the maximum of the C iv emis- 
sion line when we k now that this l i ne is often b lueshifted 
compared to Mg n fcaskelll [l982t iMcIntosh et alj Il999t 
I Vanden Berk et all l200lt iRichards et alJ l2002t IShen et alJ 
2008: lHewett & WikftOiaT " 

If the Mg ii emission line is present in the spectrum, clearly 
detected, and not affected by sky subtraction, the visual in- 
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spection redshift is set by eye at the maximum of this line. 
The typical uncertainty is estimated to be Az < 0.003. The 
redshift is refined further, as described below. 
- In other cases and for z > 2.3 quasars, such that Mg n is 
redshifted into the noisy part of the red spectrum where sky 
subtraction errors make it unreliable, the redshift is estimated 
using the positions of the red wing of the C iv emission line 
which is known to be often blueshifted compared to Mg n 
and of the peak of the Lyman emission line. The precision is 
estimated to be Az < 0.005. 

The visual redshift is not accurate to better than Az ~ 0.003. 
but can be used as a reliable guess for further automatic redshift 
determination (see Section©. Fig.|9]displays the distribution of 
the velocity difference between the visual inspection redshift es- 
timate and the redshift provided by the BOSS pipeline. At z < 2 
the pipeline estimate is usually good and does not require sig- 
nificant adjustments. In the redshift range 2.0-2.3, about half 
of the redshifts are modified because the Mg n emission line is 
available and defines clearly the visual inspection redshift while 
the pipeline finds often a slightly lower redshift. At z > 2.3, 
10% of the redshifts are corrected. Only 1,116 quasars (~2%), 
regardless of ZWARNING flags, have a difference between the 
pipeline and visual redshifts larger than 0. 1 . 
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Fig. 9. Normalized (to unit integral) distribution of the velocity 
difference between the pipeline and visual inspection redshift 
estimates for different redshift bins. About half of the pipeline 
redshifts are corrected during the visual inspection. Most of the 
corrections are for quasars with 2 < z < 2.5 where the Mg n 
emission line is available and where the pipeline redshift esti- 
mate does not correspond to the peak of the Mg n emission line. 



tion are flagged as well whatever the identification of the 
object is. These quality flags are pipeline-version dependent 
and are not meant to be released with the catalog. They are 
mainly useful for feedback to the pipeline team. 

3.3. A note on Damped Lyman-a systems 

In the course of the visual inspection, we flag the spectra with 
strong H i absorption (DLAs) in the Lyman-o' forest. At this 
point we do not try to measure the column density or to deter- 
mine the redshift of the DLA. Flagging these lines of sight can be 
useful to complement the search for DLAs by automatic proce- 
dures since this is a notoriously difficult task. Fig. [TUl shows the 
number of DLAs we flag along SDSS-DR7 lines of sight reob- 
served by BOSS, versus the A^(H i) column density. It can be seen 
that we visually recover most of the DLAs (log N(H i) > 20.3) 
identified in the SDSS-DR7 by Noterdaeme et al. (2009). Only 
11 such DLAs are missed by the visual inspection out of 257. 
The detection and analysis of DLAs in BOSS spectra is beyond 
the sc ope of this paper and will be described in lNoterdaeme et aTl 
d20ll . 
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Fig. 10. H i colu mn density distribution for DLAs and sub- 
DLAs detected by Noterdae me et al.1 J2009) in quasars observed 
both by SDSS-DR7 and BOSS (black histogram). The red his- 
togram displays the same distribution but for DLAs flagged after 
visual inspection of BOSS spectra. This shows that the visual in- 
spection is robust for log N(H i) > 20.3, the standard definition 
of DLAs. 



In addition, peculiar spectral features are flagged: 

- When a damped Lyman-a absorption line is present in the 
forest, the object is assigned a flag "DLA". This flag can be 
used to check automatic Dam ped Lyman-Q' detections (see 
iNoterdaeme et1dl2009l 120121 

- Broad absorption lines in C iv and/or Mg n are also flagged. 
At this point there is no estimate of the width of the lines 
and we stay conservative. This flag can be used to check au- 
tomatic BAL detections (see Section lBTTT i. 

- Problems such as the presence of artificial breaks in the spec- 
trum, obviously wrong flux calibration, or bad sky subtrac- 



4. Automatic redshift estimate 

The visual inspection provides a reliable and secure redshift es- 
timate for each quasar. Nevertheless, it is somewhat subjective 
and the accuracy of such an estimate is limited and cannot be 
better than 500 km s _1 . In principle, it is possible to estimate 
the redshift of a quasar using a linear combination of principal 
components to fit the spectrum: the well known systematic shifts 
between emission lines are intrinsically imprinted in the compo- 
nents and the method can take into account the variations from 
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Classification # pipeline # pipeline # visual inspection 
with ZWARHIHG=0 

QSO 102,696 86,855 87,822 = 

QS0withz> 2 (69,975) (64,004) (65,185) 

QS0_? - - 1,362 

QS0_Z? - - 112 

Galaxy 10,563 6,812 6,120 

Star 67,009 49,475 81,307 

Star.? - - 578 

Bad - - 2,599 

? - - 368 

Total 180,268 143,142 180,268 



Table 2. Number of objects identified as such by the pipeline with any ZWARNING value (second column) and with ZWARNING = 
(third column), and after the visual inspection (fourth column). 



quasar to quasar (see e.g. lParis et al.ll20TlT) . This should be a re- 
liable procedure providing: 

- The reference sample used to derive the principal compo- 
nents is representative of the whole quasar population; 

- The redshift of each quasar in the reference sample is reli- 
able. 

We will derive PCA components in order to reproduce the quasar 
spectrum between 1,410 and 2,900 A, in the quasar rest frame, 
so that most of the prominent emission lines are covered, espe- 
cially C iv and Mg n. This will yield an automatic estimate of the 
quasar redshifts. These components will be also used to fit emis- 
sion lines individually to estimate a redshift for each emission 
line from the peak of the fit model. To derive these PCA com- 
ponents, we will use a reference sample of quasars for which 
the two main emission lines are well observed. The redshifts of 
the quasars in the reference sample have also to be chosen care- 
fully. The technique to derive PCA components of quasar spectra 
has been described in detail in several pa pers (e.g. lFrancis et al.l 
U992tlYip et al.ll2004l:ISuzuki et aUl2005h . We refer the reader in 
particular to Section 2.3 of Paris et al. (201 1). 

4. 1 . Selection of the reference sample 

To compute a set of principal components from a sample as rep- 
resentative as possible of the whole quasar population, we se- 
lected quasar spectra in SDSS-DR7 meeting the following re- 
quirements: 

- The rest frame wavelength range 1,410 - 2,900 A is red- 
shifted into the observed wavelength range 3,900 - 9,100 A 
(i.e. 1.77 < z < 2.13). This observed wavelength range is 
chosen to avoid the flux-excess iss ue in the very b lue portion 
of the spectra (Section 2.4.1 and Pari s et al 1 1201 ll) and bad 
sky line subtraction at the red end. 

- The median squared SNR per pixel over the full wavelength 
range is higher than 5. 

- The spectra do n ot display BAL troughs as listed in the 
lAllen et al.l d201 lb catalog. 

In SDSS-DR7, 8,986 quasar spectra meet these requirements. 
They all were visually inspected to remove spectra with obvious 
reduction issues (missing pixels, continuum breaks or very bad 
flux calibration). We finally used 8,632 spectra. 

The low-SNR cut we use here maximizes the number of 
quasars used for the PCA decomposition and makes our sample 
as representative as possible of the BOSS quasars. 



4.2. Computing principal components 

We now need an accurate redshift for each quasar before we 
calculate the PCA eigenvectors. We first describe the use of 
Hewett & Wild (2010) redshifts and then an improved approach 
using the peak of the Mg n emission line in individual spectra. 

Using Hewett & Wild (2010) redshifts: 

We first consider the redshifts provided by Hewett & Wild 
(2010; HW10). They have performed a systematic investigation 
of the relationship between different redshift estimation schemes 
and have derived empirical relationships between redshifts based 
on different emission lines. They generated a high-SNR quasar 
template covering the UV and optical bands to be used to cal- 
culate cross-correlation redshifts. They estimate and correct for 
the quasar luminosity-dependence of systematic shifts between 
quasar emission lines. They are thus able to reduce systematic 
effects dramatically, correcting redshifts for the mean system- 
atic shifts between emission lines. Note however that this does 
not fully account for intrinsic quasar-to-quasar variation among 
the population. 

Using these redshifts for the sample of representative quasars 
defined in Section l4~Tl we derive the PCA eigenvectors. We then 
use the set of principal components to fit a linear combination of 
4 principal components to the whole spectrum of z > 2.2 SDSS- 
DR7 quasar spectra and estimate their redshifts. This number 
of components has been chosen after several trials in order to be 
able to derive a robust redshift for the maximum of objects. Note 
that the samples used to compute the principal components and 
to which we apply the procedure are disjoint. 

The median of the distribution of the velocity differences 
between the redshift given by HW10 and our redshift estimate 
is less than 30 km s . However, the rms of this distribution is 
about 1,200 km s _1 which is undesirably large and is presum- 
ably due to quasar to quasar variations in emission-line shifts. 

We can try to overcome this drawback by using a redshift 
that is more representative of the individual characteristics of 
the quasars in the reference sample. This is why we will derive 
a redshift from the observed Mg n emission line in each quasar 
spectrum. Indeed this line has been recognized as a reliable 
indicator of the actual redshift of the quasar (Shen et al. 2007, 
HW10). 

Using Mg ii emission line redshifts: 

Using the set of PCA components previously described, we 
fit the Mg ii emission line of each quasar in the same SDSS-DR7 
reference sample. From this fit, we define the Mg n redshift us- 
ing the peak-flux position of the emission line fit. Using a com- 
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bination of principal components to fit an emission line avoids 
the need to assume a line profile (e.g. Gaussian, Lorentzian or 
Voigt). 

To estimate the quality of each emission line fit: 

- We compute the amplitude of the emission line (expressed in 
units of the median error pixel of the spectrum in the window 
we use to fit the line) from the maximum flux relative to a 
fitted power-law continuum. 

- We measure the FWHM of the emission line in km s _I . 

The amplitude-to-FWHM ratio (expressed in s km 1 ) provides 
an estimate of the prominence of the emission line. In particular, 
a weak and broad emission line will display a very low value of 
the amplitude-to-FWHM ratio. 

To confirm the quality of the Mg n line measurement, we 
also fit C iv emission lines using the same procedure. The C iv 
emission line is easier to fit since it is stronger and the region of 
the spectrum where it is redshifted is cleaner. If C iv could not 
be fit, we also considered the Mg n fit to be unreliable. 

We then used the 7,193 spectra with both C iv and Mg n 
amplitude-to-FWHM ratios larger than 8 x 10~ 4 s krrr 1 to com- 
pute the new PCA components to be applied to the whole spec- 
tra. 

We use the set of principal components derived with the 
Mg ii redshifts in the following. Fig.QT]displays me mean spec- 
trum together with the first five principal components. 
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Fig. 11. Mean spectrum and the first five principal components 
derived in Section 14.21 A linear combination of the first four 
principal components is used to estimate the global redshift of 
the quasar, while five components are used to fit emission lines 
locally. 



4.3. Redshift estimates for BOSS quasars 

For each quasar in the DR9Q catalog, we use four principal com- 
ponents to fit the overall spectrum after having subtracted the 
mean spectrum. Four components are enough to repr oduce the 
overall shape of the sp ectrum and derive the redshift dYip et al.l 
2004: Ipjris et al.ll201 ll) . However, in order to avoid poor fitting 



due to the presence of strong absorption lines and especially 
BAL troughs, we first obtain a fit with only two principal com- 
ponents. This number is chosen because it provides a reason- 
able estimate of the amplitude of emission lines. Using this first 
guess, we remove pixels below 2cr and above 3<x of the contin- 
uum where cr is defined as the median flux error in an 11 pixel 
window. We are thus able to remove broad absorption lines and 
badly subtracted sky emission lines (especially at the very red 
end of the spectra). We then increase the number of principal 
components iteratively to three and four, removing narrow ab- 
sorption lines, keeping the same detection thresholds. 

Then, taking the visual inspection redshift estimate as an ini- 
tial guess, it is possible to determine a redshift for each quasar 
by fitting a linear combination of four principal components to 
the spectrum, in which the redshift becomes a free parameter. 
We call this redshift the PCA redshift. 

In addition, and in the same way as described in the previous 
subsection, we used five principal components to fit the Mg n 
emission line in BOSS spectra when possible and derived a red- 
shift from the peak flux of the fit model. Using PCA allows to 
recover the line without a priori assumptions about the line pro- 
file in a region of the spectrum affected by sky subtraction. In 
the following we will call this redshift the PCA Mg n redshift 
estimate. 

We compare in Fig.[T2]the distributions of the velocity differ- 
ence between PCA and PCA Mg n redshift estimates. The PCA 
was applied to all BOSS quasars with 1.57 < z v isuai < 2.3 so that 
both C iv and Mg n emission lines are in the observed redshift 
range and are not strongly affected by sky subtraction. We con- 
sidered three PCA estimates, varying the rest frame wavelength 
range over which the PCA was applied : (i) 1,410 - 2,850 A (full 
range); (ii) 1,410 - 2,500 A (Mg n is not included) and (iii) 
1410 - 1800 A (only C iv is in the range). There are 18,271 ob- 
jects. It can be seen in Fig.[T2]that the distributions are very sim- 
ilar. The median and rms of the distributions are (-35.3, 642), 
(-52.2, 780) and (-30.3, 851) km s" 1 respectively for the three 
wavelength ranges. The rms is dominated by low SNR spec- 
tra and slightly increases when the amount of information de- 
creases. The similarity of the distributions clearly shows that the 
PCA redshift estimate is consistent with the Mg n estimate even 
when Mg n is not included in the fit. 

4.4. Comparison to HW10 

In order to compare the HW10 redshift estimates to ours, we 
selected SDSS-DR7 quasars re-observed by BOSS in the red- 
shift range 2.00 < z < 2.30. We also restricted the sub-sample to 
quasars for which we were able to fit the Mg n emission line reli- 
ably and required the amplitude-to-FWHM of this line be larger 
than 8 x 10~ 4 s km -1 . Even though the Mg n emission line is 
still detectable up to z = 2.5, we restrict the redshift range to 
below z - 2.3 to avoid the red end of the spectra where sky 
lines can be badly subtracted. 746 quasar spectra remain for the 
comparison. 

Fig. [13]displays the distributions of velocity differences be- 
tween the PCA Mg ii reds hift esti mate and our PCA global esti- 
mate (black histogram) or IH W 1 Ol redshift (red histogram). Both 
distributions were normalized in the same manner and we also 
took into account the difference in the rest frame wavelength 
used by the different authors. 

The median IHWTOI redshift estimate is shifted by +136.9 
km s _I (with positive velocity indicating redshift) compared to 
our median Mg n redshift with an rms of 467 kms -1 . Both 
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Fig. 12. Distributions of the velocity difference between PCA 
redshift estimates derived using different rest frame wavelength 
ranges and the PCA Mg n redshift estimate. 



HW10 and Shen et al. (2011) find that the median shift of the 
Mg ii emission line relative to the [OIII] doublet is smaller than 
30 km s _1 . These discrepancies in the median velocity shift may 
not be very significant as different fitting recipes for any of these 
lines (Mg n, [OIII], C iv) can potentially cause systematic veloc- 
ity differences of this order. 

The median shift of our global estimate compared to the 
Mg ii redshift is -49.9 km s -1 , with an rms of 389 km s . The 
rms of the distribution is smaller than previously because we re- 
strict our comparison here to spectra with high SNR. It is more 
peaked and the number of outliers is lower. This is not surpris- 
ing as we use the same components to fit the overall spectrum. 
However this illustrates the intrinsic dispersion between the re- 
sults from the two methods. 

The overall conclusion is that our PCA estimate is very close 
to the Mg ii emission line redshift. And we are confident that the 
application of the procedure using PCA components to quasars 
for which the Mg n emission line is redshifted beyond the ob- 
served wavelength range, will give robust redshift estimates. 

4.5. Emission line redshifts 

Following the procedure described above, it is possible to re- 
produce the shape of each emission line with a linear combina- 
tion of principal components. This combination can therefore fit 
the individual lines without any a priori assumption about the 
line profile. In the case of individual lines we have more flex- 
ibility to use more components because the fit is more stable 
over a smaller wavelength range. We will use five PCA compo- 
nents and define the position (redshift) of the line as the position 
of the maximum of this fit. Table [3] displays the definition of 
each window used to fit emission lines together with the vacuum 
rest frame wavelengths taken from the NIST databas^] used to 
compute the redshift. For multiplets (e.g. C iv and Mg n), the 
rest frame wavelength used is the average wavelength over the 
transitions in the multiplet weighted by the oscillator strengths. 
Together with the redshift estimate of each line, we also retrieve 
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Fig. 13. Normalized distributions of the velocity difference be- 
tween our global PCA redshift estimate (black histogram), the 
pipeline redshift estimate (blue histogram) or iHewett & Wild! 
(2010) redshifts (red histogram) with the redshift derived from a 
PCA fit of the Mg n emission line (see text). 



Transition 


Window 


Rest frame wavelength (A) 


Civ 


1450-1700 


1549.061 


Cm] 


1800-2000 


1908.734 


Mg ii 


2600-2850 


2798.778 



http://physics.nist.gov/PhysRefData/ASD/lines_form.html 



Table 3. Window and rest frame wavelength used to fit each 
emission line. 



information on the symmetry of the line. We compute the blue 
(red) HWHM (half width at half maximum) from the PCA fit, 
bluewards (redwards) of its maximum. The total FWHM is the 
sum of the blue and the red HWHMs. The continuum is provided 
by the fit of a power law over the rest frame wavelength windows 
1450 - 1500, 1700 - 1850 and 1950 - 2750 A. 

In Fig.[14]we plot the velocity of C iv relative to Mg n versus 
the absolute magnitude of the quasar. The more luminous the 
quasar, the more blueshifted is the C iv emission line. Errors in 
the fit are less than 200 km _1 . These measurements can be useful 
to understand the relative shifts between different emi ssion lines 
and discuss the structure of the broad line region (see Shen et al. 
120071 IShang eTaIll2007l) . 

The C m]/ll909 fine is blended with Si m]A1892 and to a 
lesser extent with Al ni/11857. We do not attempt to deblend 
these lines. This means that the redshift and red HWHM de- 
rived for this blend should correspond to C m]/U909, but the 
blue HWHM is obviously affected by the blend. 

5. Broad absorption line quasars 

Broad absorption troughs are flagged as BAL during the visual 
inspection. This flag means that an absorption feature broader 
than a usual intervening absorption (those arising in galaxies ly- 
ing along the line of sight to the quasar) is seen. These BALs 
may affect the Lyman-a' forest and should be removed from its 
analysis. We flag mostly C iv BALs but also Mg n BALs. Since 
during the visual inspection we do not measure the width of the 
trough, there is no a priori limit on the strength of the absorption. 
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Fig. 14. Velocity difference between C iv and Mg n emission line 
redshifts as a function of the absolute i magnitude of the quasar. 
The solid black line shows the median velocity shift in 0.2 mag 
bins. Blue and cyan histograms display the 10 th , 30 th , 70 th and 
90 th percentiles. The mean shift between the two emission lines 
increases with the quasar luminosity. 



We also implemented an automatic detection of C iv BALs. 
We describe in Section lBTTl the method used to detect BALs and 
estimate their properties automatically. We then test the robust- 
ness of the visual inspection in Section l5T2l and the results of the 
automatic detection in Section [53l 

In the following subsections, we will concentrate on C iv 
BALs with z > 1.57. The quasar redshift limit is chosen so that 
the Si iv emission line is included in the spectra. This ensures 
that C iv BALs can be measured across the full range of veloci- 
ties in balnicity index, e.g. up to 25,000 km s _1 . 

5.1. Method used to estimate BAL properties automatically 

In order to detect BALs and to characterize the strength of the 
troughs using an objective pr ocedure, we compute the balnic- 
ity (BI. |Wevmann et al.|[l99ll) and the absorption indices (AI, 
lHall et al.ll2002h of the C iv troughs. In addition, we introduce a 
new index, the detection index, DI, which is a slight modifica- 
tion of BI. In Section IB~3l we will measure these indices for all 
quasars regardless of visual inspection. 

The continuum has to be estimated first. For this, we use the 
same linear combination of four principal components described 
in Section [4] The resulting continuum covers the region from 
the Si iv to the Mg n emission lines (see examples in Fig. fT3T >. 
As described in Section 4.5, the procedure iteratively avoids ab- 
sorption features and especially the BALs. During the automatic 
procedure, we smoothed the data with a five pixel boxcar me- 
dian. 

With this continuum, we compute the balnicity index (BI) in 
th e blue of the C iv emiss ion line using the definition introduced 
bv lWevmann et alj d!991l) : 



BI 



y-3,000 

J25.000 



1 - 



/(v) 
0.9 



C(v)dv, 



(1) 



where f(v) is the flux normalized to the continuum as a function 
of velocity displacement from the line center. C(v) is initially set 
to and can take only two discrete values, or 1 . It is set to 1 
whenever the quantity 1 -/(v)/0.9 is continuously positive over 
an interval of at least 2,000 km/s. It is reset to zero whenever the 
quantity in brackets becomes negative. Therefore BI = does 
not mean that no trough is present. It means that, if a trough is 
present, the absorption does not reach 0.9 times the estimated 
continuum over a continuous window of 2,000 km s _1 . 

We will also define a detection index, DI, giving C a value 
1 over the whole trough if the criterion of a continuous trough 
over 2000 km s _1 is fulfilled. This index has the advantage of 
measuring the strength over the whole trough. This index will 
be useful to apply cuts in the analyses of the Lyman-a' forest. 
Indeed these analyses need an estimate of the total strength of 
the trough in order to avoid lines of sight spoiled by a strong 
BAL. 

To study weaker troughs. Hall et al.l(l2002l) introduced the AI 
measurement defined as 



AI 



J 25 ,000 



1 - 



/(V) 

0.9 



C(v) dv, 



(2) 



where /(v) is the normalized flux and C(v) has the same defini- 
tion as for the DI except that the threshold to set C to 1 is reduced 
to 450 km s _1 . The AI index was introduced in order to take into 
account weaker troughs and to measure troughs that are located 
close to the quasar rest velocity. It is however more sensitive 
to the continuum placement than the BI. Note that Trump et al. 
(2006) used a modified version of the AI wherein the factor of 
0.9 was removed from the integral to make the AI an equivalent 
width measured in km s _1 , where 1,000 km s _1 was the thresh- 
old instead of 450 km s _1 , and where the integral extended to 
29,000 km s _1 . In this work we use the original Hall et al. (2002) 
definition of the AI. 

Following Trump et al. (2006), we calculate the reduced x 1 
for each trough: 



trougi 



l-/(v) 



(3) 



where N is the number of pixels in the trough, f(v) is the nor- 
malized flux and <x the rms of the pixel noise. The greater the 
value of^ ough , the more likely the trough is not due to noise. 

We apply the automatic detection to all quasars in the DR9Q 
catalog and provide values of DI, AI and BI. We estimate also an 
error on the indexes. The error squared is obtained by applying 
the same formula as for the indexes replacing (1 - f/0.9) by 
(cr/0.9) 2 with cr the rms of the noise in each pixel. Note however 
that the error on the strength of the trough is most of the time 
dominated by the placement of the continuum. To estimate the 
latter we have displac ed the fitted continuum by 5% and applied 
Ea.(2) of lKaspi et alJ d2002h . 



5.2. Robustness of the visual detection of BALs 

During the visual inspection, we are conservative and flag a BAL 
only if the trough is apparent. In addition, the automatic detec- 
tions rely on the position of the continuum while the visual in- 
spection lacks this problem. This means that the BAL sample 
from the visual inspection is purer than those from automatic 
detection. It is however unavoidable that, as the strength of the 
absorption or the spectrum SNR decreases, the visual inspec- 
tion will start to be subjective. On the other hand the fraction of 
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Fig. 15. Examples of high-redshift BAL quasar spectra in different ranges of signal-to-noise ratio and balnicity indices. The fit of 
the continuum is overplotted. 



false BALs detected by the automatic procedure will be higher. 
Fig. [16] shows the ratio of the number of visual BALs to that 
of the automatic detections as a function of SNR per pixel at 
/L-gt = 1700 A for BI > 500 km s" 1 . 
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Fig. 16. Ratio of the numbers of BAL visual and auto- 
matic detections as a function of spectral SNR per pixel at 
,l rest = 1,700 A for BI > 500 km s _1 . As expected, this ratio 
decreases with decreasing SNR. 



Out of the whole DR9Q catalog, 7,533 quasars have been 
flagged visually as BAL. Out of the 69,674 quasars with z > 
1.57, 7,228 are flagged as BAL by visual inspection. If we re- 



strict the latter sample to quasars with SNR>10 at 1,700 A in the 
rest frame we have flagged 1,408 BALs out of 7,317 quasars, a 
fraction of 19.2% w hich compares well with what was found by 
iGibson et alj d2009l). 

iTrump et all d2006l) measured BAL troughs (BI and AI) in 
the SDSS-DR3 release. We compare their detections and BI 
measurements with ours for quasars in common between BOSS 
an d SDSS-DR3 Out o f the 477 BALs (BI > 0) that are detected 
bv lTrumpetal](l2006l) . we flag 425. We checked the BOSS spec- 
tra of these quasars individually. About half of them are not 
BALs and a handful, all with BI < 500 km s _1 , are real BALs 
that were missed by the visual inspection. For the rest, it is hard 
to decide if they are real or not because of poor SNR. Note that, 
in general, BOSS spectra are of higher SNR than previous SDSS 
spectra. 

There are an additional 296 quasars in Tru mp et"al] (|2006) 
that have C iv troughs that we do not flag as BALs. These all 
have AI > but BI - 0. The histogram of AI measurements from 
ITrump et al.l ((2006) for these objects is plotted as well in Fig. [T7] 
(black histogram). Most of the missing troughs have AI smaller 
than 1,000 km s . A visual inspection of the BOSS spectra re- 
veals that most of the AIs have been overestimated and about 
half are not real mainly because the continuum in the red side of 
the C iv emission line has been overestimated. 

lAllen et all (120111) searched for BALs in SDSS-DR6; they 
measured only BI. Out of the 7,223 quasars with z > 1.57 
in common with BOSS, they find 722 quasars with BI > 0. 
Of these 7,223 objects, we flag 1,259 as BALs of which 853 
have BI>0. We checked the 1 31 objects for which we measured 
BI > but lAllen etall d2011l) found BI = individually. Some 
of the additional BALs are iden tified because of better SNR 
in BOSS, some were missed bv lAllen et al.l (1201 ll) because of 
difficulties in fitting the emission line correctly, a handful are 
explained by the disappearance of the BAL between the two 
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Fig. 17. Distribution of BI and AI for quasars detected as BAL 
by automatic procedures in previous SDSS releases, and that 
were not flagged by the visual inspection of the BOSS spectra. 
The black hi s togram shows the distribution of AI as measured by 
iTrump et al.l d2006) for 296 such quasars (all have BI = 0). About 
half of the m are not re a l BAL s (see text). From the automatic de- 
tection by [Allen et al.l (1201 ll) (blue histogram), 57 quasars were 
missed by the visual inspection. Here again, only a handful of 
these objects actually display BAL troughs. 



epochs dFiliz A k et al. 2012) and also by some appea rances (see 
Fig.[18T >. We also find 57 objects that are detected by Alle n et al.l 
(201lFand are missing in our visual detection. The BI distribu- 
tion of these objects is shown as the blue histogram in Fig. [17] 
About half of them are not BALs upon re-inspection and a hand- 
ful are real BALs missed by the visual inspection. The nature of 
the rest of the objects is unclear. 




Wavelength (A) 

Fig. 18. Example of appearing BAL troughs. This quasar has 
been observed in SDSS-DR7 (red curve) and in SDSS-DR9 
(black curve). The two spectra have been scaled to have a sur- 
face unity between 5,600 and 6,200A. The normalized flux in the 
C iv region expressed in velocity is displayed in the inset. This 
quasar had was not detected as BAL in SDSS-DR7 (i.e. BI = 0) 
while it has BI = 1,826 km s _I in the SDSS-DR9 spectrum. 



We conclude from this comparison that our catalog of BAL 
quasars flagged by visual inspection is pure at the 95% level, but 
is probably incomplete below BI~500 km s" 1 . This results from 
the conservative approach we adopted when flagging the troughs 
implying that the number of detections in the visual inspection 
is decreasing with decreasing SNR. 

It is difficult to estimate the incompleteness especially at 
low SNR because none of the previously published samples is 
reliable at small BI values. Therefore we caution the reader 
against blind uses of the catalog. SNR at rest wavelength 
1700 A (SNR_1700) is provided in the catalog. This can be used 
to identify the more reliable spectra. 

The BI distributions normalized by the total number of 
quasars with BI>500 km s in each sample for visual ly flagged 
BALs in DR9 (this work) and DR6 (lAllen et al]l201ll) are com- 
pared in Fig. Q2] We find 3,130 BALs with BI>500 km s _1 out 
of 69,674 BOSS quasars with z > 1.57 (4.5 %). If we restrict 
ourselves to quasars with SNR > 10, these numbers are 813 
BALs out of 7,317 quasars, corresponding to a rate of 11.1%. 
This compares well with the ~1Q% uncorrected observed frac- 
tion of BAL found by Allen et al. (201 1) at z ~ 2.5. 
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Fig. 19. Normalized distributions of the logarithm of balnicity 
indices measured from C iv troughs. The BI distribution from 
the present catalog (black histogram) computed from 7,227 visu- 
ally flagged BAL quasars is very similar to the distribution from 
Trump et al. (2006, red histogram) obtained f rom 1,102 BAL 
quasa rs from the SDSS-DR3 quasar catalog (Schneide r et al.l 
2005). The distribution is also very similar to the BI distribution 
from Allen et al. (201 1, blue histogram) based on the SDSS-DR6 
quasar catalog. 



5.3. Automatic detection 

We also performed an automatic detection of the C iv troughs 
using the continua in the wavelength range between the Si rv to 
C iv emission lines computed as described in Section|4] Bis, AIs 
and DIs are calculated for all quasars with z > 1 .57 using Eq. [1] 
and Eq.|2] The values are given in the catalog together with the 
number of troughs both with width > 2000 and 450 km s _1 . We 
also give, for quasars with BI > 0, the minimum and maximum 
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velocities relative to z em , v m ; n and v max , spanned by the whole 
absorption flow. 

Out of the 69,674 (resp. 7,317) quasars with z > 1.57 (resp. 
and SNR_1700>10), 8,124 (resp. 3,499) BALs, with^ h > 10, 

have AI > km s _1 . A visual inspection of spectra with small 
values of AI indicates that a number are due to inadequate con- 
tinuum fitting. We advise to be careful with AI values smaller 
than 300 km s _1 (see also below). 

Out of the 69,674 (resp. 7,317) quasars with z > 1.57 (resp. 
and SNR_1700>10), 4855 (resp. 1,196) BALs have BI > 
kms -1 . This corresponds to 7% (resp. 16.3%). 821 BALs 
(11.2%) have BI>500 km s" 1 . While the overall detection rate 
is larger than for the visual inspection, it is important to note 
that the automatic detection finds only 8 more objects with BI> 
500 km s in spectra with SNR>10 than the visual inspection. 
Upon reinspection, we found that half of them are not real and 
are due either to a peculiar continuum or to the presence of 
strong metal lines from a DLA at z- d bs ~ z em - Three are real, but 
shallow BALs. This shows that the automatic and visual detec- 
tions give nearly identical results for BI>500 km s . At lower 
BI and lower SNR, and consistently with what was found by 
comparison with previous surveys, the number of unreliable de- 
tections is large. 
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to +4 
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Fig. 20. Balnicity index (B I) from this work against BI mea- 
sured bv lAllenetai] (1201 ll) for SDSS-DR6 objects re-observed 
by BOSS. 



We compare in Fig. [20] the BI values measured by Allen 
et al. (2011) for SDSS-DR6 spectra with BI values measured 
by our automatic procedure using BOSS spectra for the same 
quasars. Although the scatter is large, the median difference is 
only ~30 km s . Note that part of the scatter is probably due 
to BA L variability (see iGibson et al.ll2008l l2010t iFiliz Ak et all 
120121) . In Fig. [2TJ left (resp. right) panel, we compare the fre- 
quency distribution of AI (resp. BI) values in our BAL sample 
detected automatically with that of previous studies. The distri- 
butions are normalized in the same manner. It can be seen that 
the shape of the distributions are very similar. They peak around 
AI = 300 km s _1 which is the lower limit we set for robust de- 
tection. 



We have shown here that BI measurements provided in the 
catalog are robust for SNR > 5 (see Fig.lTSb and BI > 500 km s . 
Any statistical analysis should be restricted to the correspond- 
ing sample. The catalog gives a few properties of detected C iv 
troughs and of Si iv and Al in troughs but only in cases where 
BI(C iv) > 500 km s _1 and SNR>5. These troughs have been 
measured by Gibson et al. (2009) in SDSS-DR5. 

6. Description of the DR9Q catalog 

The DR9Q catalog is available both as a standard ASCII 
file and a binary FITS table file at the SDSS public website 
http://www.sdss3.org/dr9/algorithms/qsojcatalog.php The files 
contain the same number of columns, the FITS headers con- 
tain all of the required documentation (format, name, unit of 
each column). The following description applies to the standard 
ASCII file. Table |4] provides a summary of the information con- 
tained in each of the columns in the ASCII catalog. The supple- 
mental list of quasars (see Section 8) together with the list of 
objects classified as QS0_? are also available at the same SDSS 
public website. 

Notes on the catalog columns: 

I. The DR9 object designation, given by the format 

SDSS Jhhmmss.ss+ddmmss.s; only the final 18 characters are 
listed in the catalog (i.e., the "SDSS J" for each entry is 
dropped). The coordinates in the object name follow IAU con- 
vention and are truncated, not rounded. 

2-3. The J2000 coordinates (Right Ascension and D eclination) 
in dec imal degrees. The astrometry is from DR9 (see lAhn et alj 
I20T2I) . 

4. The 64-bit integer that uniquely describes the spectroscopic 
observation that is listed in the catalog (Thing JD). 
5-7. Information about the spectroscopic observation 
(Spectroscopic plate number, Modified Julian Date, and 
spectroscopic fiber number) used to determine the characteris- 
tics of the spectrum. These three numbers are unique for each 
spectrum, and can be used to retrieve the digital spectra from 
the public SDSS database. 

8. Redshift from the visual inspection (see Section [3~2l . 

9. Redshift from the BOSS pipeline (see Section [2] and 
iBolton et al.ll2012h . 

10. Error on the BOSS pipeline redshift estimate. 

II. ZWARNING flag from the pipeline. ZWARNING > indi- 
cates bad fits in the redshift-fitting code. 

12. Automatic redshift estimate from the fit of the quasar contin- 
uum over the rest frame wavelength range 1 ,4 1 0-2,000 A with a 
linear combination of four principal components (see Section©. 
When the velocity difference between automatic PCA and vi- 
sual inspection redshift estimates is larger than 3000 km s _1 , this 
PCA redshift is set to - 1 . The inaccuracy in the PCA estimate is 
often due to difficulties in the fit of the continuum. In that case 
no automatic measurements are made on these objects and BI is 
set to - 1 . 

13. Error on the automatic PCA redshift estimate. If the PCA 
redshift is set to -1, the associated error is also set to -1. 

14. Estimator of the PCA continuum quality (between and 1). 
See Eq.(ll) of Paris et al. (2011). 

15-17. Redshifts measured from C iv, C m] complex and Mg 11 
emission lines from a linear combination of five principal com- 
ponents (see Section|4|i. 

18. Morphological information. If the SDSS photometric 
pipeline classified the image of the quasar as a point source, the 
catalog entry is 0; if the quasar is extended, the catalog entry is 1 . 
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Fig. 21. Left p anel: Distribution o f absorption indices (AI) from our automatic detection (black histogram) and from SDSS-DR3 
(red histogram. ITrump et al.l ll2006). The distributions are normalized for log AI > 3. The difference between the two results at low 
AI is a consequence of slightly different formula used to measure AI. Right panel: Distributio n of balnicity indices (BI) from our 
automatic detection (black histogram) and from SDSS-DR6 (blue histogram, lAUen et alJ[20lTh . 



19-21. Quasars targeted by BOSS are tracked with the 
B OSS-TARGET1 fla g bits (19; see details of selection method 
in iRoss et a D 120121) . In addition, 5% of fibers on each 
plate are dedicated to ancillary programs tracked with the 
ANCILLARY .TARGET 1 (20) and ANCILLARY.TARGET2 
(21) flag bits. Th e bit values and the corresponding program 
names are listed in lDawson et al. I d20l2h . 

22. A quasar known from SDSS-DR7 has an entry equal to 1, 
and otherwise 

23-25. Spectroscopic plate number, Modified Julian Date, and 
spectroscopic fiber number in SDSS-DR7. 

26. Uniform flag. See SectionlTH 

27. The absolute magnitude in the i band at z = 2 cal- 
culated after correction for Galactic extinction and assuming 
Hq = 70 km s _1 Mpc~', Qm = 0.3, Qa = 0.7, and a power-law 
(frequency) continuum i ndex of -0.5. The K-c orrection is com- 
puted using Table 4 from lRichards et all (2006). 

28. The A(g - i) color, which is the difference in the Galactic 
extinction corrected {g - i) for the quasar and that of the mean 
of the quasars at that redshift . If A{g - i) is not defined for the 
quasar, which occurs for objects ateitherz < 0.12orz > 5.12 the 
column will contain "-9. 000". See Section|7]for a description of 
this quantity. 

29. Spectral index a v (see Section r7T2l >. 

30. Median signal-to-noise ratio computed over the whole spec- 
trum. 

31. Median signal-to-noise ratio computed over the window 
1,650-1,750 A in the quasar rest frame. 

32. Median signal-to-noise ratio computed over the window 
2,950-3,050 A in the quasar rest frame. 

33. Median signal-to-noise ratio computed over the window 
5,100-5,250 A in the quasar rest frame. 

34-37. FWHM (kms 1 ), blue and red half widths at half- 
maximum (HWHM; the sum of the latter two equals FWHM), 



and amplitude (in units of the median rms pixel noise, see 
SectionUJi of the C iv emission line. 

38-39. Rest frame equivalent width and corresponding uncer- 
tainty in A of the C iv emission line. 

40-43. Same as 34-37 for the C m] emission complex. It is well 
known that C m]/ll909 is blended with Si m]/ll892 and to a 
lesser extend with Al invll857. We do not attempt to deblend 
these lines. Therefore the redshift and red HFHM derived for 
this blend correspond to C iii]/11909. The blue HFWM is obvi- 
ously affected by the blend 

44-45. Rest frame equivalent width and corresponding uncer- 
tainty in A of the C m] emission complex. 
46-49. Same as 34-37 for the Mg n emission line. 
50-51. Rest frame equivalent width and corresponding uncer- 
tainty in A of the Mg n emission line. 

52. BAL flag from the visual inspection. It is set to 1 if a BAL 
feature was seen during the visual inspection. It is set to oth- 
erwise. Note that BAL quasars are flagged during the visual in- 
spection at any redshift. 

53-54. Balnicity index (BI) for C iv troughs, and its error, ex- 
pressed in km s _1 . See definition in Section IBTTl The Balnicity 
index is measured for quasars with z > 1.57 only. If the BAL 
flag from the visual inspection is set to 1 and the BI is equal to 
0, this means either that there is no C iv trough (but a trough is 
seen in another transition) or that the trough seen during the vi- 
sual inspection does not meet the formal requirement of the BAL 
definition. In cases with bad fits to the continuum, the balnicity 
index and its error are set to - 1 . 

55-56. Absorption index, and its error, for C iv troughs expressed 
in km s -1 . See definition in Section IBTTl In cases with bad con- 
tinuum fit, the absorption index and its error are set to -1. 
57-58. Detection index, and its error, for C iv troughs expressed 
in km s _1 . See definition in Section IBTTl In cases with bad con- 
tinuum fit, the detection index and its error are set to - 1 . 
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59. Following Trump et al. (2006), we calculate the reduced 
X 2 for each trough from Eq. [3] We require that troughs have 
Ximu h > 10 to be considered as true troughs (see Section lBTTT i. 

60. Number of troughs of width larger than 2,000 km s _1 
61-62. Full velocity range over which C iv troughs are at least 
10% below the continuum for troughs of width larger than 
2,000 km s- 1 . 

63. Number of troughs of width larger than 450 km s _1 
64-65. Full velocity range over which C iv troughs are at least 
10% below the continuum for troughs of width larger than 
450^8-'. 

66-68. Rest frame equivalent width in A of Si iv, C iv and Al in 
troughs detected in BAL quasars with BI > 500 km s _1 and 
SNR_1700 > 5. They are set to otherwise or in cases where 
no trough is detected and to -1 if the continuum is not reliable. 
69-70 The SDSS Imaging Run number and the Modified Julian 
Date (MJD) of the photometric observation used in the cata- 
log. The MJD is given as an integer; all observations on a given 
night have the same integer MJD (and, because of the observa- 
tory's location, the same UT date). For example, imaging run 
94 has an MJD of 51075; this observation was taken on 1998 
September 19 (UT). 

71-74 Additional SDSS processing information: the photomet- 
ric processing rerun number; the camera column (1-6) contain- 
ing the image of the object, the field number of the run con- 
taining the object, an d the object identification number (see 
Stoug hton et al.ll2002l for descriptions of these parameters). 
75-84. DR9 flux and errors (not corrected for Galactic extinc- 
tion) in the five SDSS filters. 

85-89. TARGET photometric flux in the five SDSS filters. 
90. Galactic extinc tion in the u band based on the maps of 
ISchleeel et al.l (Il998h . For an Ry = 3.1 absorbing medium, the 
extinctions in the SDSS bands can be expressed as 

A x — C .\ A u 

where x is the filter (u griz), and values of C e . r j 7 are .736, 0.534, 
0.405, and 0.287. See Schlafly & Finkbeiner (201 1) however. 



91. The logarithm of the Galactic neutral hydrogen column den- 
sity along the line of sight to the quasar. These values were 
estima ted via interpolation of the 21 -cm data from Star k et al.l 
(fl992h . using the COLDEN software provided by the Chandra 
X-ray Center. Errors associated with the interpolation are typi- 
cally expected t o be less than « 1 x 10 20 ctrT 2 (e.g., see §5 of 
lElvis et al.lfl994h . 

92. The logarithm of the vignetting-corrected count rate (pho- 
tons s~') in the broad energy band (0.1 -2.4 keV) in the RO SAT 
All-Sky Survey Faint Source Catalog dVoges et al.l l2000[) and 
the R OSAT All-Sky Survey Bright Source Catalog dVoges et al.1 
Il999h . The matching radius was set to 30" (see Section l7.5.11 >: 

93. The SNR of the ROSAT measurement. 

94. Angular Separation between the SDSS and ROSAT All-Sky 
Survey locations (in arcseconds). 

95-98. UV fluxes and errors from GALEX, aperture- 
photometered from the original GALEX images in the two bands 
FUV and NUV (see Section[732]l. 

99- 1 00. The J magnitude and error from the Two Micr on All Sky 
Surve y All-Sky Data Release Point Source Catalog (Cutri et all 
I2003Y using a matching radius of 2.0" (see Section I7.5.3I I. A 
non-detection by 2MASS is indicated by a "0.000" in these 
columns. Note that the 2MASS measurements are Vega-based, 
not AB, magnitudes. 

101-102. SNR in the J band and corresponding 2MASS jr_d flag. 
103-106. Same as 98-101 for the H-band. 



107-110. Same as 98-101 for the #-band. 
111. Angular separation between the SDSS and 2MASS posi- 
tions (in arcseconds). 

112-113. The wl magnitude and error from the W ide-field 

Infrared Survey Explorer (WISE; IWright et al.ll2010h All-Sky 

Data Release Point Source Catalog using a matching radius of 

2" (see Section mU l. 

1 14-1 15 SNR and x 1 in the WISE wl band. 

116-119. Same as 1 1 1-1 14 for the w2-band. 

120-123. Same as 1 1 1-1 14 for the w3-band. 

124-127. Same as 1 1 1-1 14 for the w4-band. 

128. Angular separation between SDSS and WISE positions (in 
arcseconds). 

129. If there is a source in the FIRST catalog (version July 2008) 
within 2.0" of the quasar position, this column contains the 
FIRST peak flux density (see Section r733l An entry of "0.000" 
indicates no match to a FIRST source; an entry of "— 1.000" in- 
dicates that the object does not lie in the region covered by the 
final catalog of the FIRST survey. 

130. The SNR of the FIRST source whose flux is given in col- 
umn 128. 

131. Angular separation between the SDSS and FIRST positions 
(in arcseconds). 



7. Summary of sample 

7.1. Broad view 

The DR9Q catalog contains 87,822 unique, visually confirmed 
quasars, of which 65,205 and 61,931 have, respectively, z > 2 
and z > 2.15. 91% of these quasars were discovered by BOSS. 
The first two years of operations cover an area of approxi- 
mately 3,275 deg 2 leading to a mean density of >15 quasars with 
z > 2.15 per square degree. In the following, we describe the 
properties of the quasar population drawn from the whole sam- 
ple. However, we also provide a uniform flag (see Section f7~4l >. 
A sample of quasars with uniform > Q is a sufficiently statis- 
tical sample for, e.g., c lustering measurements on some scales 
(e.g JWhite et al.ll2012l) and luminosity function demographics. 

Quasars from the present catalog span a range of redshift 
fromz = 0.058 to z = 5.855. The redshift distribution is given 
in Fig. [22] together with that from SDSS-DR7 (red histogram, 
ISchneider et al.ll2~010h . It is apparent from the figure that BOSS 
primarily targets z > 2.15 quasars as it was designed. Only 7,932 
of those quasars were previously known, e.g. detected by previ- 
ous surveys and the majority of those were previous SDSS dis- 
coveries. The DR9Q catalog thus contains about 2.6 times more 
high-redshift quasars than the whole SDSS-I/II survey. The two 
peaks in the redshift distribution at z ~ 0.8 and z ~ 1 .6 are due to 
known degeneracies in the SDSS color space. Six objects have 
z < 0. 1 . These are Seyfert galaxies that were classified as quasars 
in order to differentiate them from normal galaxies. 

Fig. [23] displays the redshift distributions in the redshift 
range of interest for BOSS for the whole sample (black his- 
togram), the CORE sample (red histogram) and the BONUS 
sample (blue histogram). The CORE s ample is selecte d via uni- 
form target selection (see details in iRoss et aTl l2012h . and is 
designed for statistical studies of the quasar population (see 
Section 7.4). On the other hand, the BONUS sample is the re- 
sult of the combination of four target selection algorithms. This 
sample was designed to maximize the number of high-redshift 
quasars. Typical spectra are shown in Fig. [24] 

Table [5] gives the number of objects targeted by the vari- 
ous selection methods and visually inspected (column #2) as de- 
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Fig. 22. The redshift distribution of quasars from this cata- 
log is displayed in black. The same distribution is shown for 
newly discovered quasars only (dashed blue histogram). Most 
of the SDSS-DR9 quasars have a redshift greater than 2. The 
redshift distribution o f quasars from the SDSS-DR7 catalog 
(ISchneider et al.ll20 1 Oh is shown for comparison in red. The lat- 
ter is dominated by quasars at low redshift. The present catalog 
contains 2.6 times more quasars at z > 2.15 than the DR7 cata- 
log. 




Redshift 

Fig. 23. Redshift distribution of SDSS-DR9 quasars in the 
range 2.00-4.00 for the whole distribution (black histogram), the 
CORE sample (red histogram) and the BONUS sample (blue 
histogram). The CORE sample was uniformly selected through 
the likelihood method ( Kirk patrick et al.l|201 ll) duri ng most of 
the fir st year of operation and the XDQSO method (iBovv et al.l 
1201 ll) for the second year. The BONUS sample was selected 
through a combination of four target selection algorithms to 
maximize the number of high-redshift quasars in the sample. 



scribed in Ross et ail d2012l) together with the number of objects 
classified by visual inspection as quasars (column #3), quasars 
with z > 2.15 (column #5), stars (column #6) or galaxies (col- 
umn #7). Column #8 and #9 give, respectively, the number of 
objects with good spectra but uncertain identification and the 
number of objects with data of too low SNR to allow for identi- 
fication (see Section 3 for a detailed description of the different 
categories). Note that a single object can be selected by several 
methods. 

BOSS targets fainter objects than SDSS-I/II. The r-PSF 
magnitude distribution (corrected for Galactic extinction) of 
SDSS-DR9 quasars is shown in Fig. [25] (top panel), and peaks 
at ~20.8. The median signal-to-noise ratio computed over the 
whole spectrum versus the r-PSF magnitude is shown in the bot- 
tom panel of Fig. [25] Percentiles are indicated in grey. The /-PSF 
magnitude distribution of quasar candidates, spectroscopically 
confirmed quasars and z > 2.15 confirmed quasars are shown in 
Fig. [26] There is no drop of the success rate at high magnitude, 
indicating again that the SNR threshold used to define a survey 
quality plate is well chosen. 

Fig. [27] shows the distribution of objects in the redshift- 
luminosity (L versus z) plane for the BOSS survey (black con- 
tours and points) together with the same quantities fo r the SDSS- 
DR7 (red contours and points; Schneid er et al.ll2010r) . We calcu- 
late the absolute /-band (at z - 2) magnitudes, Mi, using the 
observed / -band PSF magnitude s and the K-corrections given in 
Table 4 of Rich ards et alT(l2006l) . This shows the coverage avail- 
able for calculating the evolution of the faint end of the quasar 
luminosity function, and for placing constraints on the luminos- 
ity dependence of quasar clustering (W hite et al]l2012l) . 



Fig. [28] shows the SDSS (u-g), (g-r), (r-/), and (i-z) colors 
as a function of redshift for the DR9Q catalog. Also shown 
are the mean color in r edshift bins (thin red solid line), and 
the models described in iRoss et aTl d2012. in prep J, thick col- 
ored lines). This model is systematically bluer than the data at 
low redshift; BOSS target selection systematically excludes UV- 
excess quasars. The trends with redshift are due to various emis- 
sion lines moving in and out of the SDSS broadband filters, 
and t he onset of the Lyman-q fores t and Lyman-limit systems 



re.g..lFaiJl999LlHennawi et al J2010tlRichards et al.l200"2ir2003l: 
Bow et all 120121: iPeth et al.l 1201 ' I); se e also iProchaska et al.l 
(120091) and lWorseck & Prochaskal d20"TTb for biases in the SDSS 
target selection. 

Fig.l29lshows the SDSS color-color diagrams for the quasars 
in the DR9Q catalog. This figure illustr ates the redshift depen- 
dence of quasar colors (see also Fig. [28] lFarJl999h . The quasars 
at z ~ 2.7 are located in the stellar locus (black contours). 

7.2. Spectral index and composite spectra 

The quasar continuum can be expressed as / cont oc y" e " st , where a v 
is the spectral index. This index is obtained by fitting a power law 
over wavelength ranges outside the Lyman-o' forest and devoid 
of strong emission lines. The regions of the fits are 1450-1500, 
1700-1850 and 1950-2750 A in the rest frame. The continuum 
is iteratively fitted to remove absorption lines and to limit the 
impact of the iron emission blends on the a v measurement. 

The distribution of the quasar spectral index of SDSS-DR7 
quasars re-observed by BOSS is shown in Fig. [31] The median 
spectral index measured for BOSS spectra (black histogram) is 
Ov,dr9 = -0.517 while the median value measured with SDSS- 
DR7 spectra is a v pRi = -0.862. This discrepancy is mainly the 
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Selection 


Maskbits 


# Objects 


#QSO 


#QSOz > 2.15 


# STAR 


# GALAXY 


#? 


# BAD 


BOSS.TARGET1 


QSO.CORE 


10 


3,468 


1,3 


1,084 


1,975 


64 


41 


11 


QSCLBONUS 


11 


4,259 


803 


437 


3,319 


89 


30 


18 


QSOJCNOWNJVIIDZ 


12 


9,927 


9,775 


9,121 


36 


3 


56 


57 


QSO_KNOWN_LOHIZ 


13 


24 


24 

















QSOJSfN 


14 


72,365 


45,319 


34,864 


25,541 


569 


407 


529 


QSCLUKIDSS 


15 


48 


27 


20 


19 


2 








QSOJCDE.COADD 


16 


1,362 


305 


202 


921 


56 


50 


30 


QSOJJKE 


17 


90,762 


54,313 


35,869 


32,909 


1,699 


938 


903 


QSO_FIRST_BOSS 


18 


3,348 


2,507 


1,629 


433 


142 


174 


92 


QSOJCDE 


19 


92,203 


47,564 


34,252 


42,248 


1,021 


647 


723 


QSCLCOREJVIAIN 


40 


67,677 


41,817 


32,355 


23,930 


799 


461 


670 


QSO_BONUS_MAIN 


41 


148,085 


76,660 


53,000 


65,320 


2,931 


1,467 


1705 


QSO_CORE_ED 


42 


22,715 


15,019 


12,387 


7,055 


198 


169 


274 


QSO_CORE_LIKE 


43 


23,951 


17,635 


12,522 


5,591 


319 


188 


218 


QSO_KNOWN_SUPPZ 


44 


24 


24 

















ANCILL ARY.TARGET 1 


QSO.AAL 


22 


174 


172 


1 





1 





1 


QSO.AALS 


23 


281 


277 


2 








1 


3 


QSOJAL 


24 


80 


80 

















QSO_RADIO 


25 


72 


71 











1 





QSO_RADIO_AAL 


26 


58 


58 

















QSO_RADIO.IAL 


27 


31 


30 














1 


QSO.NOAALS 


28 


32 


31 











1 





QSO.GRI 


29 


1,117 


354 


343 


373 


177 


26 


187 


QSO_HIZ 


30 


335 








272 


4 


3 


56 


QSO.RIZ 


31 


728 


47 


42 


545 


78 


11 


47 


ANCILLARY.TARGET2 


HIZQS082 





62 


2 


2 


55 


1 





4 


HIZQSOIR 


1 


28 








25 








3 


KQSO_BOSS 


2 


183 


81 


39 


89 


4 


9 





QSO.VAR 


3 


1,380 


856 


296 


431 


85 


5 


3 


QSO_VAR_FPG 


4 


576 


549 


263 


6 


3 


14 


4 


RADI0.2LOBE_QSO 


5 


332 


149 


15 


131 


29 


7 


16 


QSCLSUPPZ 


7 


208 


208 

















QSO_VAR_SDSS 


8 


1,887 


568 


166 


1,185 


39 


35 


60 



Table 5. Number of visually inspected DR9 BOSS quasar targets (third column) and identifications in the DR9Q catalog for each 
target selection method (first column; see Table 4 of Ross et al. 2012, and Tables 6 and 7 in the Appendix of Dawson et al., 2012). 
These categories overlap because many objects are selected by multiple algorithms. 



consequence of the inaccuracy of the BOSS flux calibration in 
the blue (see Fig. |5). This may explain as well the fact that the 
distribution is more symmetric than previously measured (e.g. 
iRichards et al.ll2003l) lacking the red tail. Therefore the reader 
should be careful of this measurement using BOSS quasar spec- 
tra. 

Although the absolute flux calibration is in error, it is in- 
teresting to compare the composite spectra in different absolute 
magnitude bins. They are displayed in Fig. [32] for the absolute 
magnitude bins -25.0 < M; < -23.5 (magenta), -26.5 < M; < 
-25.0 (blue) and M; < -26.5 (black). The equivalent widths of 
the emission lines decreases with increasing lu minosity. This is 
the well-known Baldwin effect (Bald winlll977h . The rest equiv- 
alent widths of the most important equivalent emission lines is 
given for different absolute magnitude bins in Table [6] 

7.3. Rest equivalent widths in individual spectra 

As explained in Section 4.5, we used five PCA components to fit 
the emission lines and derive their redshift. We used these fits to 
measure also the rest equivalent width and widths (FWHM and 
half widths at half maximum) of the emission lines. The con- 
tinuum is fitted as a power law to the best PCA component fit 
over the windows 1,450-1,470 A and 1,650-1,820 A. We modi- 



Mi [z = 2] 




Restframe equivalent width (A) 
Si iv Civ Cm] Mg n 


-25.0 < M, < 
-26.5 < Mi < 
M; < -26.5 


-23.5 
-25.0 


10.6 65.8 31.3 44.2 
9.5 48.6 27.4 37.0 
8.3 34.0 23.4 29.8 



Table 6. Rest frame equivalent widths measured on the compos- 
ite spectra displayed in Fig. [32] 



fied the windows used by Shen et al. (201 1) (1,445-1,465 A and 
1,700-1,705 A) to minimize the fraction of bad fits, especially 
for emission lines narrower than the mean. 

Fig. [30] shows the comparison betwe en the rest equiva lent 
width measured on SDSS_DR7 spectra bv lShen et all (120 111) and 
that measured on BOSS spectra of the same quasars. Our rest 
equivalent widths are about 10 % smaller on average. This sys- 
tematic shift is likely related to a difference in the rest frame 
wavelength range used to compute the rest frame equivalent 
width. While we strictly limited the equivalent width compu- 
tation to the 1,500-1,600 A range, Shen et al. (2011) used this 
range to fit the line but accounted for the extra wings to estimate 
the rest frame equivalent width. The rms scatter is about 33 %. 
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Fig. 24. First ro w, left: spectrum of the highest redshift quasar (z = 5.855) observed by BOSS ; this quasar was discovered by 
iFan et alj d2004l) : the quasar with highest redshift discovered by BOSS is SDSS J222018.50-010147.0 at z = 5.605. First row, 
right: spectrum of the most luminous (M; [z — 2] = -30.91) quasar available in this catalog. Middle and bottom rows: Four typical 
quasar spectra selected to be representative in terms of SNR at different redshifts (z ~ 2.3, 2.5, 2.7, 3.5). The SNR listed is the 
median SNR per pixel over the whole spectrum. All the spectra were boxcar median smoothed over 5 pixels. 



We checked by hand some of the largest discrepancies and found 
that our procedure seems to behave well. We applied our pro- 
cedure to both SDSS-DR7 data and BOSS data from the same 
quasars. The mean difference is 4% and rms 25%. Statistical er- 
rors are of the orde r of 15%, and variability can account for an- 
other 15% (see e.g. lBentz et dj2009tlwmiite et al.l2006h . BOSS 
spectra are also of better quality. 

The rest equivalent widths are listed in the catalog for C iv, 
the C m] complex and Mg n. A value of -1 indicates that the 
PCA failed to fit the emission line. The variance was computed 
as the integral over the width of the line of the variance in each 
pixel. Note however that errors are mostly due to the position of 
the continuum. 

7.4. Uniform sample 

We provide a similar uniform flag in our catalog to previous 
versions of the SDSS quasar catalogs (e.g., Schneider et al. 
2007). Quasars in our catalog with uniform = 1 are CORE tar- 



gets that were selected with the XDQSO technique (Bovy et al. 
2011) after XDQSO became the CORE targ eting algorithm o f 
choice for BOSS (e.g., in or after Chunk 12: iRoss et al.ll2012h . 
XDQSO will remain the BOSS quasar target algorithm for the 
rest of the survey, so this uniform = 1 sample will grow sig- 
nificantly in subsequent releases. 



Quasars with uniform = 2 would have been selected by 
XDQSO if it had been the CORE algorithm prior to Chunk 
12. uniform = 2 object s are quite compl ete to what XDQSO 
would have selected (e.g jRoss et al.ll2012l) . so uniform > is 
a sufficiently statistica l sample for, e.g., c lustering measurements 
on some scales fe.g.. IWhite et al.l|2012l) . Quasars in our cata- 
log with uniform = are not homogeneously selected CORE 
targets. Finally, the very few (30) quasars in our catalog with 
uni form = - 1 have no chunk information, but are a sufficiently 
small sample to be discarded for the purposes of statistical anal- 
yses. 
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Fig. 28. SDSS colors vs. redshift for quasars in the DR9Q catalog. The thin solid red line is the median color in bins of redshift. 
The t hick color lines are mod els from simulations used to determine the BOSS quasar completeness (McGreer et al., in prep.; see 
also iRoss et al in prep.h for three different quasar luminosities: M,[z = 2] - -22.49 (cyan ), M,[z = 2] = -24.99 (green) 

and Mi[z = 2] = -27.49 (orang e); and empirical tracks for the DR7 quasars (blue Bo w et al]|201 lb . These simulations include the 
Baldwin effect (Ba ldwinll 1977b . Therefore the colors depend on the quasar luminosity. The model is systematically bluer than the 
data at low redshift because BOSS systematically excludes UV-excess sources. 



7.5. Multiwavelength matching 
7.5.1 . ROSAT all sky survey 

We cross-correlate the DR9Q catalog with the ROSAT all sky 
survey catalogues listing the sources detected in the energy band 
0.1-2.4 keV. The matching radius is set to 30". 

We report the logarithm of the vignetting-corrected count 
rate (phot ons s" 1 ) from the R OSAT All-Sky Survey Faint Source 
Catalog dVoges et all 120001) and the R OSAT All-Sky Survey 
Bright Source Catalog dVoges et al.ll999l) . An entry of "-9.000" 
in the column RASS_COUNTS indicates no X-ray detection. We 
also report the SNR at the position of the quasar and the separa- 
tion between the quasar and the X-ray source. 

There are 16 matches with the Bright Source Catalog and 
298 with the Faint Source Catalog. It never happened to find 
more than one source within the matching radius. No DR9 
quasar is detected both in the Bright and Faint Source cata- 
logues. Only the most reliable detections were included in our 
catalog: X-ray counterparts for 13 quasars were flagged for pos- 



sible detection quality issues and therefore are not included in 
the present quasar catalog. 

7.5.2. The Galaxy Evolution Explorer (GALEX) 

The GALEX space mission (Mar tin et al.ll2005l) has performed 
an all-sky imaging survey in two UV bands (FUV: 1350 to 1750 
A ; NUV: 1750 to 2750 A) down to m AB ~ 20.5 a nd a medium- 
deep imaging survey that reaches otab ~ 23 (e.g.. lBianchi et all 
Both surveys are used here. 

GALEX images are force photometering GALEX images 
(from GALEX Data Release 5) at the SDSS-DR8 centroids 
(Aihara et al. 2011), such that low signal-to-noise point-spread 
function (PSF) fluxes of objects not detected by GALEX is ob- 
tained. 

A total of 77,236 quasars lie in the GALEX FUV footprint, 
78,062 lie in the NUV footprint, and 77,197 are covered by both 
bandpasses. 
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Fig. 29. Color-color diagrams for all quasars in the DR9Q catalog. Colors of points encode their redshifts (see top right panel). The 
stellar locus is represented with black contours. 



7.5.3. The two micron all sky survey (2MASS) 

We cross-correlate the DR9Q catalog with the All-Sky Data 
Release Point Source Vatalog (Cutri et al., 2003) using a match- 
ing radius of 2.0". 

Together with the Vega magnitudes in the J, H and K-bands 
(xMAG with x = J, H or K) and their errors (ERR_xMAG), 
we report the SNR (xSNR) since the errors on the magni- 
tude do not differentiate the 2cr upper limits (in a 4" ra- 
dius aperture) from detections. We also give for each band 
the value of the 2MASS flag rd_flg[l] (entry xRDFLAG) 
which gives the meaning of the peculiar values of xMAG and 
ERR_xMAG (see http://www.ipac.caltech.edu/2mass/releases/- 
allsky/doc/explsup.html) 

There are 1,441 matches in the catalog. 

7.5.4. The Wide-Field Infrared Survey (WISE) 

We take the DR 9Q catalog, and ma tch to the Wide-Field Infrared 
Survey (WISE; IWright et al.ll2010h All-Sky Data Releasfl ask- 
ing for all quasars in the DR9Q catalog that are in the All-Sky 
Source Catalog. WISE photometry covers four bands, 3.4, 4.6, 
12 and 22 /mi, where the angular resolution of WISE is «6" 

5 http://wise2.ipac.caltech.edu/docs/release/allsky/ 



for 3.4, 4.6, 12 /an, and «12" for 22 /mi dWright et alJ feOlO). 
After testing for various matching radii (1", 2", 6", 12", 18"), 
we use a matching radius of 2.0", and a total of 45,987 rows of 
WISE photometry data are returned, along with the separation 
in arcseconds between the SDSS and WISE source (stored in 
SDSSWISE_SEP). 

In the DR9Q catalog, we report the photometric quan- 
tities, wxmpro, wxsigmpro, wxsnr, wxrchi2, where x 
I 4 and represents the four WISE bands centered at wave- 
lengths of 3.4, 4.6, 12 and 22 /mi. These magnitudes are 
in the Vega system, and are measured with profile-fitting 
photometry (see e.g. http://wise2.ipac.caltech.edu/docs/release/- 
allsky/expsup/sec2J2a.html and http://wise2.ipac.caltech.edu/- 
docs/release/ allsky/expsup/ sec4_4c . html#wpro) . 

Formulae for converting WISE Vega magnitudes to 
flux dens ity units (in Jansk ys) a nd AB ma g nitude s are 
given in Wright et aT| (1201 Oh and Qarrett et aD (120111) and 
also here: http://wise2.ipac.caltech.edu/docs/release/- all- 
sky/expsup/ sec4_4h. html#conv2flux 

Although the MIR WISE properties of the BOSS quasars 
will be valuable for many scientific questions, we strongly urge 
the user to not only consider the various "health warnings" asso- 
ciated with using the BOSS quasar dataset (as given in Section 
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Fig. 25. Top panel: Distribution of r magnitude of SDSS-DR9 
quasars (PSF; corrected for Galactic extinction). Bottom panel: 
Median SNR per pixel over the whole spectrum with respect to 
the r-PSF magnitude (black histogram). Percentiles are indicated 
in grey. 



2) but also those connected to the WISE All-Sky Release Data 
Product^. 

We do not investigate any of the 2MASS, or UKIDSS prop- 
erties associat ed with WISE matches he re, but these are being 
investigated in Ross et al] (12012. in prep.h . 

7.5.5. FIRST 

We cross-correlate the DR9Q catalog with objects that are de- 
tected in the FIRST radio survey (Becker et al. 1995). We use 
the version of July 2008. 

If there is a source in the FIRST catalog within 2.0" of the 
quasar position, we indicate the FIRST peak flux density and the 
SNR. Note that extended radio sources may be missed by this 
matching. 
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Fig. 26. Fraction of quasar candidates confirmed as quasars (red 
histogram) and z > 2.15 quasars (blue histogram) versus the i- 
band PSF magnitude (corrected for Galactic extinction). 
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redshift 

Fig. 27. L - z plane for SDSS-DR9 quasars (black con- 
tours and points) and SDSS-D R7 quasars (red contours 
and points; Schne ider et al.l l2010h . The luminosity assumes 
Hp _ 70 km s~~' M pc -1 and the K-correction is given by 
Rich ards et al.l d2006l) who consider K(z — 2) = 0. Contours 
are drawn at constant point density. 



Note that, as in SDSS-I/II, FIRST sources are automatically 
included in the target selection. An additional cut in color (u- 
g > 0.4) is added to av oid as much as possible low-redshift 
sources dRoss et al.ll2012l) . The catalog contains 3,283 FIRST 
matches. 



8. Additional quasars 

We provide a supplemental list of 949 quasars, of which 318 
at z > 2.15, that have been identified among quasar targets af- 
ter DR9 was "frozen" (Section 18. Il l or among galaxy targets 
(Section [8.2b . This supplemental list of quasars is provided in 
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Fig. 30. Rest frame equivalent width of the C iv emission line 
of SDSS-DR7 qu asars, measured from the SDSS-DR7 spectra 
dShen etal.ll2011h and spectra obtained by BOSS (this work). 



Restframe wavelength (A) 

Fig. 32. Composite spectra of BOSS quasars in different ranges 
of absolute magnitude: -25.0 < M; < -23.5 (magenta), -26.5 < 
Mi < -25.0 (blue) and M\ < -26.5 (black). All the spectra were 
normalized to have a flux unity near 1450 A in the quasar rest 
frame. The Baldwin effect is apparent (see Table 6). 
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Spectral index oc v 

Fig. 31. Distribution of the spectral index a v of z > 2 of SDSS- 
DR7 quasars (red histogram) re-observed by BOSS (black his- 
togram). The spectral index was measured using the rest frame 
wavelength ranges 1450-1500, 1700-1850 and 1950-2750 A. As 
seen already from the composite spectrum shown in Fig. [5] the 
spectral indices measured using SDSS-DR9 quasar spectra are 
bluer than those obtained using DR7 spectra (see Section 7.2). 



the same format as the DR9Q catalog but in a separate file 
and is meant to be merged with the whole catalog for DR10. 
Fig.[33]gives the redshift distribution of these additional quasars. 
The list is available together with the DR9Q catalog and the 
list of objects classified as QS0_? at the SDSS public website 
http://www.sdss3.org/dr9/algorithms/qsoxatalog.php 



8.1. Additional quasars from the quasar target list 

The quasar catalog was frozerQin February 2012, but we subse- 
quently identified an additional 301 quasars (294 with z > 2.15) 
that have been targeted as quasar candidates. Some of these were 
identified with improvements of the pipeline. Others are identi- 
fied from a good spectrum taken on a plate which was not survey 
quality so was not included in the first inspection. A handful are 
objects that have been misidentified during the first inspection 
but were corrected during the checks. 

In addition, a few ancillary programs were not included in 
the first inspection. Therefore the supplemental list contains the 
quasars that have been targeted only by these programs. These 
comprise 500 quasars of which 20 have z > 2.15. 

Finally, only objects classified as QS0 or QS0_BAL are listed 
in the official DR9Q catalog. Objects classified as QS0_Z? (126 
in total; 122 corresponding to the DR9Q inspection) are also in- 
cluded in the supplemental list. Most of the latter are very pecu- 
liar BAL quasars. 

8.2. Galaxy targets 

In order to be as complete as possible we also tried to identify 
serendipitous quasars. For this, we visually inspected all objects 
from the BOSS galaxy target list that the pipeline reliably classi- 
fies (ZWARNING = 0) as quasars with z > 2, and all objects clas- 
sified as GALAXY/BROADLINE. We also visually inspected 10% of 
the galaxy targets classified as quasars with ZWARNING not equal 
to zero; none were in fact quasars, so we did not inspect the 
remaining such objects. A large fraction (65%) of the unclassifi- 
able objects are attributed to the QS0 class, but with low signifi- 
cance. They include all sorts of unidentified objects and spectra 
with calibration problems but probably very few real quasars, if 
any. In total we identified 22 additional quasars, 4 of which were 
at z > 2.15, out of more than 3,000 targets. There is one quasar 
classified as QS0_Z?. 



7 By which we mean no additional quasar or change in the identifi- 
cations were intended to be included in the catalog 
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Fig. 33. Redshift distribution of the 949 additional quasars de- 
scribed in Section[8] 

9. Conclusion 

The quasar catalog presented here contains 87,822 quasars, 
61,931 having z > 2.15, with robust identification from visual 
inspection and redshift derived from the fit of PCA components 
to the spectra. This catalog has been gathered during the first two 
years of the BOSS operation covering 3,275 deg 2 . It will be the 
basis for studies of the luminosity function and the spatial distri- 
bution of quasars as well as studies of the clustering properties 
of the Lyman-ff forest. In particular it will be used to measure 
for the first time the BAO clustering signal in the IGM at z ~ 2.3 
from the Lyman-a forest. 

For quasars with z em > 1.57, the catalog identifies 7,228 
broad absorption line quasars from visual inspection, of which 
3,130 have BI > 500 km s" 1 . In the 7,317 spectra with 
SNR > 10, we find 813 BALs with BI > 500 km s 1 corre- 
sponding to a fraction of 11.1%. We implement a procedure to 
identify BALs automatically, fitting the quasar continuum with 
PCAs. 3,330 BALs with BI > 500 km s _1 have been identified in 
this way, of which 821 in spectra of SNR > 10. The catalog gives 
their characteristics, balnicity and absorption indices. The list of 
BALs will be used for statistical analysis of this population of 
quasars. Since SDSS-DR7 z > 2.15 quasars are reobserved by 
BOSS, this will be a unique opportunity to study the variability 
of these troughs. High redshift (z > 2) quasar continua together 
with pixel masks, improved noise estimates, and other products 
designed t o aid in the BAO-Lyman-g clustering analysis will be 
released in lLee et all (12012. in prepJ) . 

BOSS is a five year program and the next version of our 
quasar catalog, to be released as a part of SDSS-DR10 in July 
2013, should contain about two times as many quasars as the 
DR9Q catalog. Improvements in the pipeline will allow us to 
achieve identification of more objects. We will also perform mul- 
tiple checks and improve our procedures in order to place better 
constraints on the characterisitcs of the quasar spectra. 
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24 


MJD.DR7 


16 


SDSS-DR7 spectroscopic MJD if the quasar is known from DR7 


25 


FIBERIDJDR7 


14 


SDSS-FR7 spectroscopic Fiber number if the quasar is known from DR7 


26 


UNIFORM 


12 


Uniform sample flag 


27 


MI 


F9.4 


Mj [z = 2] (h = 70km s~' Mpc" 1 , Q. M = 0.3, £1 A = 0.7, a v = -0.5) 


28 


DGMI 


F9.4 


A(g - f) = (g - f) - ((g - OXedshift (Galactic extinction corrected) 


29 


ALPHAJSfU 


F9.4 


Spectral index measurement a v 


30 


SNRJSPEC 


F9.4 


Median signal-to-noise ratio over the whole spectrum 


31 


SNR.1700 


F9.4 


Median signal-to-noise ratio in the window 1,650 - 1,750A (rest frame) 


32 


SNR.3000 


F9.4 


Median signal-to-noise ratio in the window 2,950 - 3,050A (rest frame) 


33 


SNR.5150 


F9.4 


Median signal-to-noise ratio in the window 5,100 - 5,250A (rest frame) 


34 


FWHM.CIV 


F9.4 


FWHM of C iv emission line in km s~' 


35 


BHWHM.CIV 


F9.4 


Blue HWHM of C iv emission line in km s~' 


36 


RHWHM.CIV 


F9.4 


Red HWHM of C iv emission line in km s~' 


37 


AMPXIV 


F9.4 


Amplitude of C iv emission line in units of median rms pixel noise 


38 


REWE.CIV 


F9.4 


Rest frame equivalent width of C iv emission line in A 


39 


ERR_REWE_CIV 


F9.4 


Uncertainty on the rest frame equivalent width of C iv emission line in A 


40 


FWHM.CIII 


F9.4 


FWHM of C m] emission complex in km s -1 


41 


BHWHM.CIII 


F9.4 


Blue HWHM of C m] emission line in km s~' 


42 


RHWHM.CIII 


F9.4 


Red HWHM of C m] emission line in km s~' 


43 


AMP.CIII 


F9.4 


Amplitude of C m] emission complex in units of median rms pixel noise 


44 


REWE.CIII 


F9.4 


Rest frame equivalent width of C m] emission line in A 


45 


ERR_REWE_CIII 


F9.4 


Uncertainty on the rest frame equivalent width of C m] emission complex in A 


46 


FWHM.MGII 


F9.4 


FWHM of Mg ii emission line in km s~' 


47 


BHWHM.MGII 


F9.4 


Blue HWHM of Mg n emission line in km s~' 


48 


RHWHM.MGII 


F9.4 


Red HWHM of Mg n emission line in km s" 1 


49 


AMPJVIGII 


F9.4 


Amplitude of Mg n emission line in units of median rms pixel noise 


50 


REWEJVIGII 


F9.4 


Rest frame equivalent width of Mg n emission line in A 


51 


ERR_REWEJVIGII 


F9.4 


Uncertainty on the rest frame equivalent width of Mg n emission in A 


52 


BAL_FLAG_VI 


12 


BAL flag from visual inspection 


53 


BLCIV 


F9.4 


Balnicity index of C iv trough in km s ^ 


54 


ERR_BI_CIV 


F9.4 


Error on the Balnicity index of C iv trough in km s 


55 


ALCIV 


F9.4 


Absorption index of C iv trough in km s 


56 


ERR_AI_CIV 


F9.4 


Error on the absorption index of C iv trough in km s 


57 


DLCIV 


F9.4 


Detection index of C iv trough in km s 


58 


ERRJDLCIV 


F9.4 


Error on the detection index of C iv trough in km s~' 


59 


CHI2THROUGH 


F9.4 


X 2 of the trough from Eq.[3] 


60 


NCIV.2000 


13 


Number of distinct C iv troughs of width larger than 2,000 km s -1 


61 


VMIN.CIV.2000 


F9.4 


Minimum velocity of the C iv troughs defined in row 60 km s~' 


62 


VMAX_CIV_2000 


F9.4 


Maximum velocity of the C iv troughs defined in row 60 in km s~' 


63 


NCIV.450 


13 


Number of distinct C iv troughs of width larger than 450 km s~' 


64 


VMIN.CIV.450 


F9.4 


Minimum velocity of the C iv troughs defined in row 63 in km s" 1 


65 


VMAX_CIV_450 


F9.4 


Maximum velocity of the C iv troughs defined in row 63 in km sr l 
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Table 4. continued. 



Column 


Name 


Format 


Description 


66 


REW_SIIV 


F9.4 


rest frame equivalent width of the Si iv trough 


67 


REW.CIV 


F9.4 


rest frame equivalent width of the C iv trough 


68 


REW_ALIII 


F9.4 


rest frame equivalent width of the Al in trough 


69 


RUN .NUMBER 


16 


SDSS Imaging Run Number of photometric measurements 


70 


PHOTO JVIJD 


16 


Modified Julian Date of imaging observation 


71 


RERUN .NUMBER 


A4 


SDSS Photometric Processing Rerun Number 


72 


COL.NUMBER 


12 


SDSS Camera Column Number (1-6) 


73 


FIELD .NUMBER 


15 


SDSS Field Number 


74 


OBJ.ID 


A20 


SDSS Object Identification Number 


75 


UFLUX 


F9.4 


flux in the w-band (not corrected for Galactic extinction) 


76 


ERR.UFLUX 


F9.4 


Error in u flux 


77 


GFLUX 


F9.4 


flux in the g-band (not corrected for Galactic extinction) 


78 


ERR.GFLUX 


F9.4 


Error in g flux 


79 


RFLUX 


F9.4 


flux in the r-band (not corrected for Galactic extinction) 


80 


ERR.RFLUX 


F9.4 


Error in v flux 


81 


IFLUX 


F9.4 


flux in the z-band (not corrected for Galactic extinction) 


82 


ERRJFLUX 


F9.4 


Error in i flux 


83 


ZFLUX 


F9.4 


flux in the z-band (not corrected for Galactic extinction) 


84 


ERR.ZFLUX 


F9.4 


Error in z flux 


85 


TARGET.UFLUX 


F9.4 


TARGET flux in the «-band (not corrected for galactic extinction) 


86 


TARGET.GFLUX 


F9.4 


TARGET flux in the g-band (not corrected for galactic extinction) 


87 


TARGET .RFLUX 


F9.4 


TARGET flux in the r-band (not corrected for galactic extinction) 


88 


TARGET JFLUX 


F9.4 


TARGET flux in the z-band (not corrected for galactic extinction) 


89 


TARGET.ZFLUX 


F9.4 


TARGET flux in the z-band (not corrected for galactic extinction) 


90 


U.EXT 


F9.4 


zz band Galactic extinction (from (Schlegel et al. 1998)) 


91 


HI.GAL 


F9.4 


logWH (logarithm of Galactic H i column density in cm~ 2 ) 


92 


RASS.COUNTS 


F9.4 


log RASS full band count rate (counts s _1 ) 


93 


RASS.COUNTS.SNR 


F9.4 


SNR of the RASS count rate 


94 


SDSS2ROSAT.SEP 


F9.4 


SDSS-RASS separation in arcsec 


95 


NUVFLUX 


F9.4 


nuv flux (GALEX) 


96 


ERR.NUVFLUX 


F9.4 


Error in nuv flux 


97 


FUVFLUX 


F9.4 


fuv flux (GALEX) 


98 


ERR.FUVFLUX 


F9.4 


Error in fuv flux 


99 


JMAG 


F9.4 


J magnitude (Vega, 2MASS) 


100 


ERRJMAG 


F9.4 


Error in / magnitude 


101 


JSNR 


F9.4 


J-band SNR 


102 


JRDFLAG 


12 


J-band photometry flag 


103 


HMAG 


F9.4 


H magnitude (Vega, 2MASS) 


104 


ERR.HMAG 


F9.4 


Error in H magnitude 


105 


HSNR 


F9.4 


H-band SNR 


106 


HRDFLAG 


12 


H-band photometry flag 


107 


KMAG 


F10.6 


K magnitude (Vega, 2MASS) 


108 


ERR.KMAG 


F10.6 


Error in K magnitude 


109 


KSNR 


F10.6 


K-band SNR 


110 


KRDFLAG 


12 


K-band photometry flag 


111 


SDSS2MASS.SEP 


F10.6 


SDSS-2MASS separation in arcsec 


112 


W1MAG 


F10.6 


w 1 magnitude (Vega, WISE) 


113 


ERR.W1MAG 


F10.6 


Error in w 1 magnitude 


114 


W1SNR 


F10.6 


SNR in wl band 


115 


W1CHI2 


F10.6 


y~ in wl band 


116 


W2MAG 


F10.6 


wl magnitude (Vega, WISE) 


117 


ERR.W2MAG 


F10.6 


Error in w2 magnitude 


118 


W2SNR 


F10.6 


SNR in wl band 


119 


W2CHI2 


F10.6 


X 2 in wl band 


120 


W3MAG 


F10.6 


w3 magnitude (Vega, WISE) 


121 


ERR.W3MAG 


F10.6 


Error in w3 magnitude 


122 


W3SNR 


F10.6 


SNR in wl band 


123 


W3CHI2 


F10.6 


X 2 in wl band 


124 


W4MAG 


F10.6 


w4 magnitude (Vega, WISE) 


125 


ERR.W4MAG 


F10.6 


Error in w4 magnitude 


126 


W4SNR 


F10.6 


SNR in wl band 


127 


W4CHI2 


F10.6 


X 2 in wl band 


128 


SDSS2WISE.SEP 


F10.6 


SDSS-WISE separation in arcsec 


129 


FIRST .FLUX 


F10.6 


FIRST peak flux density at 20 cm expressed in mJy 


130 


FIRST.SNR 


F10.6 


SNR of the FIRST flux density 


131 


SDSS2FIRST.SEP 


F10.6 


SDSS-FIRST separation in arcsec 



" All magnitudes are PSF magnitudes 
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